This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 



BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 



BLACK BORDERS 

TEXT CUT OFF AT TOP, BOTTOM OR SIDES 
FADED TEXT 
ILLEGIBLE TEXT 
SKEWED/SLANTED IMAGES 
COLORED PHOTOS 

BLACK OR VERY BLACK AND WHITE DARK PHOTOS 
GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



THIS PAGE BLANK (uspto) 



WORLD PSTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



{51) International Patent Classification 6 : 

COIN 33/569, C07K 14/44, CI2N 15/12, 
15/85, 1/21, A61K 39/005, G01N 33/543, 
A61K 39/002 > 39/008 



Al 



(11) International Publication Number: 
(43) International Publication Date: 



WO 97/18475 

22 May 1997 (22.05.97) 



(21) Internationa] Application Number: PCDUS96V 18624 

(21) International Filing Date: 14 November 1996 (14.11.96) 



(30) Priority Data: 

08/557,309 



14 November 1995 (I4.il. 95) US 



(71) Applicant: CORiXA CORPORATION (US/US}. Suite 464. 

1 124 Columbia Street. Seattle, WA 98104 (US). 

(72) Inventors: REED, Steven, G.; 2843 ~ 122nd Place N.E., 

Beilevue, WA 98005 (US). SKEIKY, Yasir, A., W.; 8327 
- 25th Avenue N.W., Seattle, WA 98117 (US). t,ODES, 
Michael. J.; 9223 - 36th Avenue S.W., Seattle, WA 98126 
(US). HOUGHTON, Ravmond. L.; 2636 - 242nd Place S.E., 
Bothelt, WA 98021 (US). 

(74) Agents; MAKI, David, J. et al.; Seed and Berry LJ..P., 6300 
Columbia Center, 701 Fifth Averme, Seattle, WA 98104- 
7092 (US). 



(81) Designated States; AL, AM, AT, AU, BA. BB, BG. BR, BY, 
CA, CH. CN, CU, CZ. DE, DK, EE, ES, FI, GB, GE, HU, 
IU IS, JP, KE, KG, KP, KR, KZ, IX, LK, LR, LS. LT, LU, 
LV, MD, MG, MK, MN, MW, MX, NO, NZ, PL, PT, RO, 
RU, SD, SE, SG, SI, SK, TJ, TM, TR, TT, UA, UG ( UZ T 
VN. ARIPO patent (KE, LS. MW. SD, SZ, UG). Eurasian 
patent (AM, AZ, BY t KG, KZ, MD, RU, TJ, TM), European 
patent {AT, BE, CH, DE, DK, ES, FI, FR, GB, GR, IE, IT, 
LU, MC, NL, PT, SE). OAPI patent (BF, BJ. CF. CG, CI, 
CM, GA, GN, ML. MR, NK, SN, TD, TG). 



Published 

With international search report. 



(54) Title: COMPOUNDS AND METHODS FOR THE DETECTION AND PREVENTION OF T. CRUZ1 INFECTION 



(57) Abstract 



Compounds and methods are provided for diagnosing Trypanosoma cruzi infection. The disclosed compounds are polypeptides, 
or antibodies thereto, that contain one or more epitopes of T. cruzi antigens. The compounds are useful in a variety of immunoassays 
for detecting T. cruzi infection. The polypeptide compounds are further useful in vaccines and pharmaceutical compositions for inducing 
protective immunity against Chagas' disease in Individuals exposed to T. cruzi. 



Printed from Mimosa 01/07/09 1 1 :43:36 Page: 1 



t 




FOR THE PURPOSE OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 



AM 


Armenia 


i:b 


United Kingdom 


MW 


Malawi 


AT 


Austria 


GE 


Georgia 


MX 


Mex ico 


All 


Australia 


GN 


Guinea 


NE 


Niper 


BB 


Barbados 


CR 


Greece 


NL 


Netherlands 


Bfc 


Belgium 


HI) 


Hungary 


NO 


Norway 


BF 


Burkina Fa*o 


IB 


Ireland 


NZ 


New 7fa1and 


Br, 


Bulgaria 


IT 


Italy 


PL 


Poland 


BJ 


Benin 


JP 


Japan 


PT 


Portugal 


BR 


Brazil 


KK 


Kenya 


KU 


Romania 


BY 


He Unit 


KG 


Kyrgystai) 


KU 


Rum ian Federation 


CA 


Canada 


KP 


Democratic People's Republic 


SD 


Sudan 


CF 


Central African Republic 




of Korca 


SB 


Sweden 


CC 


Congo 


KR 


Republic of Krvea 


SG 


Singapore 


CH 


Switzerland 


KZ 


Kazakhstan 


Si 


Slovenia 


CC 


C6tc d'Woir* 


LI 


Liec bieniic In 


SK 


Slovakia 


CM 


Cameroon 


LK 


Sri Ijuilci 


SN 


Senegal 


CN 


China 


LR 


Liberia 


sz 


Swaziland 


cs 


Czechoslovakia 


LT 


Lil (mania 


TD 


Chad 


cz 


Oxen Republic 


LL 


Luxembourg 


t<; 


Togo 


l>tt 


Germany 


LV 


Latvia 


TJ 


Tajikiiian 


DK 


Denmark 


MC 


Monaco 


tt 


Trinidad and Tobago 


KB 


Estonia 


MD 


Republic of Moldova 


LA 


Ukraine 


ES 


Spain 


mg 


Madagascar 


VG 


Uganda 


Fl 


Finland 


Ml, 


Mali 


US 


I J tilled Slates of America 


FR 


France 


MN 


Mongolia 


UZ 


Cxbeklstan 


GA 


Gabon 


MR 


Mauritania 


VN 


Viet Nam 



Printed from Mimosa 01/07/09 1 1:43:37 Page: 2 



WO £7/18475 PCT/US5W18624 



Description 

COMPOUNDS AND METHODS FOR THE DETECTION 
AND PREVENTION OF T. CJgfTZ/INFFCTfON 

5 

Technical Field 

The present invention relates generally to the diagnosis of T. cruzi 
infection. The invention is more particularly related to the use of one or more 1\ cruzi 
antigenic peptides, or antibodies thereto, in methods and diagnostic kits to screen 
10 individuals and blood supplies for T. cruzt infection. The invention is also directed to 
vaccine compositions for immunizing an individual to prevent Chagas* disease. 

Background of the Invention 

Protozoan parasites are a serious health threat in many areas of the 

15 world. Trypanosoma cruzi (T. cruzi) is one such parasite that infects millions of 
individuals, primarily in Central and South America. Infections with this parasite can 
cause Chagas' disease, which may result in chronic heart disease and a variety of 
immune system disorders. It is estimated that 18 million people in Latin America are 
infected with 7. cruzu but there is no reliable treatment for the clinical manifestations 

20 of infection. No vaccine for the prevention of Chagas' disease is currently available. 

The most significant route of transmission in areas where the disease is 
endemic is through contact with an infected triatomid bug. In other areas, however, 
Wood transfusions are the dominant means of transmission. To inhibit the transmission 
of 7*. cruzi in such regions, it is necessary to develop accurate methods for diagnosing 

25 T, cruzi infection in individuals and for screening blood supplies. Blood bank 
screening is particularly important in South America, where 0.1%-62% of samples may 
be infected and where the parasite is frequently transmitted by blood transfusion. There 
is also increasing concern that the blood supply in certain U.S. cities may be 
contaminated with T. cruzi parasites. 

30 The diagnosis of T. cruzi infection has been problematic, since accurate 

methods for detecting the parasite that are suitable for routine use have been 
unavailable. During the acute phase of infection, which may last for decades, the 
infection may remain quiescent and the host may be asymptomatic. As a result, 
serological tests for T. cruzi infection are the most reliable and the most commonly 

35 used. 

Such diagnoses are complicated, however, by the complex life cycle of 
the parasite and the diverse immune responses of the host. The parasite passes through 
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an epimastigote stage in the insect vector and two main stages in the mammalian host. 
One host stage is present in blood (the trypomastigote stage) and a second stage is 
intracellular (the amastigote stage). The multiple stages result in a diversity of antigens 
presented by the parasite during infection. In addition, immune responses to protozoan 
5 infection are complex, involving both humoral and cell-mediated responses to the array 
of parasite antigens. 

While detecting antibodies against parasite antigens is the most common 
and reliable method of diagnosing clinical and subclinical infections, current tests are 
expensive and difficult Most serological tests use whole or lysed T. cruzi and require 

10 positive results on two of three tests, including complement fixation, indirect 
immunofluorescence, passive agglutination or ELISA, to accurately detect T t cruzi 
infection. The cost and difficulty of such tests has prevented the screening of blood or 
sera in many endemic areas. 

Accordingly, there is a need in the art for more specific and sensitive 

15 methods of detecting T. cruzi infections in blood supplies and individuals. The present 
invention fulfills these needs and further provides other related advantages. 



Summary of the Invention 

Briefly stated, this invention provides compounds and methods for 

20 detecting and protecting against T. cruzi infection in individuals and in blood supplies, 
and for screening for T. cruzi infection in biological samples. In one aspect, the present 
invention provides methods for detecting T. cruzi infection in a biological sample, 
comprising (a) contacting the biological sample with a polypeptide comprising an 
epitope of a T. cruzi antigen having an amino acid sequence encoded by a nucleotide 

25 sequence recited in SEQ ID NO:l - SEQ ID NO:22, or a variant of such an antigen that 
differs only in conservative substitutions and/or modifications; and (b) detecting in the 
biological sample the presence of antibodies that bind to the polypeptide, therefrom 
detecting T. cruzi infection in the biological sample. 

In another aspect of this invention, polypeptides are provided 

30 comprising an epitope of a 7*. cruzi antigen having an amino acid sequence encoded by 
a nucleotide sequence recited in SEQ ID NO:l - SEQ ID NO:21, or a variant of such an 
antigen that differs only in conservative substitutions and/or modifications. 

Within related aspects, DNA sequences encoding the above 
polypeptides, expression vectors comprising these DNA sequences and host cells 

35 transformed or transfected with such expression vectors are also provided. 

In another aspect, the present invention provides diagnostic kits for 
detecting 71 cruzi infection in a biological sample, comprising (a) a polypeptide 
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comprising an epitope of a T. cruzi antigen having an amino acid sequence encoded by 
a nucleotide sequence recited in SEQ ID NO: 1 - SEQ ID NO:22, or a variant of such an 
antigen that differs only in conservative substitutions and/or modifications; and (b) a 
detection reagent. 

5 In yet another aspect of the invention, methods for detecting the 

presence of T. cruzi infection in a biological sample are provided, comprising (a) 
contacting a biological sample with a monoclonal antibody that binds to an epitope of a 
r. cruzi antigen having an amino acid sequence encoded by a nucleotide sequence 
recited in SEQ ID NO:l - SEQ ID NO;22, or a variant of such an antigen that differs 
10 only in conservative substitutions and/or modifications; and (b) detecting in the 
biological sample the presence of T. cruzi parasites that bind to the monoclonal 
antibody. 

Within related aspects, pharmaceutical compositions comprising the 
above polypeptides and a physiologically acceptable carrier, and vaccines comprising 

15 the above polypeptides in combination with an adjuvant, are also provided. 

The present invention also provides, within other aspects, methods for 
inducing protective immunity against Chagas' disease in a patient, comprising 
administering to a patient a pharmaceutical composition or vaccine as described above. 

Within other aspects, the present invention provides methods for 

20 detecting T. cruzi infection in a biological sample, comprising (a) contacting the 
biological sample with a first polypeptide comprising an epitope of a T. cruzi aniigen 
having an amino acid sequence encoded by a nucleotide sequence recited in SEQ ID 
NO:l - SEQ ID NO;22, or a variant of said antigen that differs only in conservative 
substitutions and/or modifications; (b) contacting the biological sample with one or 

25 more additional polypeptides comprising one or more epitopes of other T. cruzi 
antigens, or a variant thereof that differs only in conservative substitutions and/or 
modifications; and (c) detecting in the biological sample the presence of antibodies that 
bind to one or more of said polypeptides, therefrom detecting T. cruzi infection in the 
biological sample. In one embodiment, the additional polypeptide comprises an epitope 

30 of TcD, or a variant thereof that differs only in conservative substitutions and/or 
modifications. In another embodiment, the additional polypeptides comprise an epitope 
of TcD (or a variant thereof that differs only in conservative substitutions and/or 
modifications) and an epitope of TcE (or a variant thereof that differs only in 
conservative substitutions and/or modifications). In yet another embodiment, the 

35 additional polypeptides comprise an epitope of TcD (or a variant thereof that differs 
only in conservative substitutions and/or modifications) and PEP-2 (or a variant thereof 
that differs only in conservative substitutions and/or modifications). 
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In yet further aspects, the present invention provides combination 
polypeptides comprising two or more polypeptides, each polypeptide comprising an 
epitope of a T. cruzi antigen having an amino acid sequence encoded by a nucleotide 
sequence recited in SEQ ID NO:l - SEQ ID NO:22, or a variant thereof that differs 
5 only in conservative substitutions and/or modifications. Combination polypeptides 
comprising at least one epitope of a T. cruzi antigen having an amino acid sequence 
encoded by a nucleotide sequence recited in SEQ ID NO: I - SEQ ID NO:22, or a 
variant thereof that differs only in conservative substitutions and/or modifications, and 
at least one epitope selected from the group consisting of TcD epitopes, TcE epitopes, 
1 0 PEP-2 epitopes and variants thereof that differ only in conservative substitutions and/or 
modifications are also provided. 

In related aspects, methods are provided for detecting T. cruzi infection 
in a biological sample, comprising (a) contacting the biological sample with at least one 
of the above combination polypeptides and (b) detecting in the biological sample the 
IS presence of antibodies that bind to the combination polypeptide. 

These and other aspects of the present invention will become apparent 
upon reference to the following detailed description and attached drawings. AH 
references disclosed herein are hereby incorporated by reference in their entirety as if 
each was incorporated individually. 

20 

Brief Description of the Drawin&s 

Figure 1 is a graph comparing the reactivity of T. cruzi lysate and a 
representative polypeptide of the present invention (rTcc6) in an ELISA assay 
performed using sera from T. crwz/-infected (Pos) and uninfected (Neg) individuals. 

25 The bars represent + 1 standard deviation. 

Figure 2 is a graph presenting a comparison of the reactivity of 
representative polypeptides of the subject invention in an ELISA assay performed using 
sera from T. cruzi-infected (Pos) and uninfected (Ncg) individuals. Experiment 1 
shows a comparison of rTcc22 and the peptides Tcc22-1 and Tcc22-1+; Experiment 2 

30 shows a comparison of rTcc22 t rTcHil2 and the peptides Tcc22-1, Tcc22-1 + and 
Tcc22-2. 1 . The bars represent + 1 standard deviation. 

Figure 3 is a graph depicting a comparison of the reactivity of T. cruzi 
lysate and a representative polypeptide (Tcc38) in an ELISA assay performed using 
sera from 7! crwzi-infectcd (Pos) and uninfected (Neg) individuals, as well as using sera 

35 from individuals with visceral leishmaniasis (VL), cutaneous leishmaniasis (CL), 
tuberculosis (TB) and malaria. The bars represent + 1 standard deviation. 
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5 

Figure 4 is a graph presenting a comparison of the reactivity of T. cruzi 
lysate and several polypeptides of the present invention, representing different reading 
frames of the TcLol and TcHilO antigens, in an ELISA assay performed using sera 
from T. crwzr-infected (Pos) and uninfected (Neg) individuals. The hars represent ± 1 
5 standard deviation. 

Figure 5 is a graph comparing the reactivity of 7*. cruzi lysate and a 
representative polypeptide (Tcclxl.2) in an ELISA assay performed using sera from T. 
cruzi-infected (Pos) and uninfected (Neg) individuals, as well as sera from individuals 
with viscera] leishmaniasis (VL), cutaneous leishmaniasis (CL), malaria and 
10 tuberculosis (TB). 

Figure 6 is a graph depicting the ELISA reactivity of a series of 
polypeptide combinations with T. cruzi positive and negative sera. 

Figure 7 is a graph presenting the ELISA reactivity of a series of TeE 
polypeptide variants with T. cruzi positive and negative sera. 
15 Figure 8 is a graph comparing the ELISA reactivity of two dipeptides, a 

tripeptide and a tetrapeptide of the present invention with T> cruzi positive and negative 
sera. 

Figure 9 is a graph presenting the ELISA reactivity of a representative 
polypeptide of the present invention (TcHi29) and of TcE with sera from normal 
20 individuals, T. cruzi patients, and patients with other diseases. 

Figure 10 is a graph comparing the ELISA reactivity of two 
representative dipeptide mixtures with 7. cruzi positive and negative sera, one mixture 
including a TcE epitope and the other including aTcIH29 epitope of the present 
invention. 

25 

Detailed Description of the Invention 

As noted above, the present invention is generally directed to 
compounds and methods for detecting and protecting against T. cruzi infection in 
individuals and in blood supplies. The compounds of this invention generally comprise 
one or more epitopes of 7! cruzi antigens. In particular, polypeptides comprising an 
epitope of a T. cruzi antigen having an amino acid sequence encoded by a nucleotide 
sequence recited in SEQ ID NO:l - SEQ ID NO:22are preferred. As used herein, the 
term "polypeptide" encompasses amino acid chains of any length, including full length 
(te„ native) antigens. Thus, a polypeptide comprising an epitope may consist entirely 
of the epitope or may contain additional sequences. The additional sequences may be 
derived from the native antigen or may be heterologous, and such sequences may (but 



30 



35 
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need not) be antigenic. A protein "having" a particular amino acid sequence is a 
protein that contains, within its full length sequence, the recited sequence. Such a 
protein may, or may not, contain additional amino acid sequence. The use of one or 
more epitopes from additional T. cruzi proteins, prior to or in combination with one or 
5 more epitopes of sequences recited herein, to enhance the sensitivity and specificity of 
the diagnosis, is also contemplated. 

An "epitope," as used herein, is a portion of a 7! cruzi antigen that reacts 
with sera from T. crwzi-infected individuals (/.e., an epitope is specifically bound by 
one or more antibodies within such sera). Epitopes of the antigens described in the 

10 present application may generally be identified using methods known to those of 
ordinary skill in the art, such as those summarized in Paul, Fundamental Immunology, 
3rd ed., 243-247 (Raven Press, 1993) and references cited therein. For example, a 
polypeptide derived from a native T. cruzi antigen may be screened for the ability to 
react with pooled sera obtained from T. crwz/-infected patients. Suitable assays for 

15 evaluating reactivity with T. cruzMnfected sera, such as an enzyme linked 
immunosorbent assay [EL1SA), are described in more detail below, and in Harlow and 
Lane, Antibodies: A Laboratory Manual Cold Spring Harbor Laboratory, 1988. An 
epitope of a polypeptide is a portion that reacts with such antisera at a level that is 
substantially similar to the reactivity of the full length polypeptide. In other words, an 

20 epitope may generate at least about 80%, and preferably at least about 100%, of the 
response generated by the full length polypeptide in an antibody binding assay (e.g., an 
ELISA). 

The compounds and methods of this invention also encompass variants 
of the above polypeptides. As used herein, a "variant" is a polypeptide that differs from 

25 the recited polypeptide only in conservative substitutions or modifications, such that it 
retains the antigenic properties of the recited polypeptide. A "conservative 
substitution" is one in which an amino acid is substituted for another amino acid that 
has similar properties, such that one skilled in the art of peptide chemistry would expect 
the secondary structure and hydropathic nature of the polypeptide to be substantially 

30 unchanged. In general, the following groups of amino acids represent conservative 
changes: (1) aia, pro, gly, glu, asp, gin, asn, ser, thr; (2) cys, ser, lyr, thr; (3) val, ile, 
leu, met, ala, phe; (4) )ys, arg, his; and (5) phe, tyr, trp, his. Variants may also, or 
alternatively, contain other conservative modifications, including the deletion or 
addition of amino acids that have minimal influence on the antigenic properties, 

35 secondary structure and hydropathic nature of the polypeptide. For example, the 
polypeptide may be conjugated to a linker or other sequence for ease of synthesis or to 
enhance binding of the polypeptide to a solid support. 
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In a related aspect, combination polypeptides comprising epitopes of 
multiple T. cruzi antigens arc disclosed. A "combination polypeptide" is a polypeptide 
in which epitopes of different T. cruzi antigens, or variants thereof, are joined, for 
example through a peptide linkage, into a single amino acid chain. The amino acid 
5 chain thus formed may be either linear or branched. The epitopes may be joined 
directly (i.e., with no intervening amino acids) or may be joined by way of a linker 
sequence (e.g., Gly-Cys-Gly) that does not significantly alter the antigenic properties of 
the epitopes. The peptide epitopes may also be linked through non-peptide linkages, 
such as hetero- or homo-bifunctional agents that chemically or photochemically couple 

10 between specific functional groups on the peptide epitopes such as through amino, 
carboxyl, or sulfhydryl groups. B afunctional agents which may be usefully employed 
in the combination polypeptides of the present invention are well known to those of 
skill in the art. Epitopes may also be linked by means of a complementary ligand/anti- 
ligand pair, such as avidin/biotin, with one or more epitopes being linked to a first 

15 member of the ligand/anti-ligand pair and then being bound to the complementary 
member of the ligand/anti-ligand pair either in solution or in solid phase. A 
combination polypeptide may contain multiple epitopes of polypeptides as described 
herein and/or may contain epitopes of one or more other T. cruzi antigens, such as TcD, 
TcE or PEP-2, linked to an epitope described herein. 

20 In general, T. cruzi antigens, and DNA sequences encoding such 

antigens, may be prepared using any of a variety of procedures. For example, a T. cruzi 
cDNA or genomic DNA expression library may be screened with pools of sera from T. 
*rraz/-infected individuals. Such screens may generally be performed using techniques 
well known to those of ordinary skill in the art, such as those described in Sambrook 

25 et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratories, 
Cold Spring Harbor, N.Y., 1989. Briefly, the bacteriophage library may be plated and 
transferred to filters. The filters may then be incubated with serum and a detection 
reagent. In the context of this invention, a "detection reagent" is any compound capable 
of binding to the antibody-antigen complex, which may then be detected by any of a 

30 variety of means known to those of ordinary skill in the art. Typical detection reagents 
for screening purposes contain a "binding agent, 41 such as Protein A, Protein G, IgG or a 
lectin, coupled to a reporter group. Preferred reporter groups include, but are not 
limited to, enzymes, substrates, cofactors, inhibitors, dyes, radionuclides, luminescent 
groups, fluorescent groups and biotin. More preferably, the reporter group is 

35 horseradish peroxidase, which may be detected by incubation with a substrate such as 
tetramethylbenzidine or 2,2'-azino-diO-ethylbenztbiazoline sulfonic acid. Plaques 
containing cDN As that express a protein that binds to an antibody in the serum may be 
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isolated and purified by techniques known to those of ordinary skill in the art. 
Appropriate methods may be found, for example, in Sarnbrook ct al., Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratories, Cold Spring Harbor, 
N.Y., 1989. 

5 DNA molecules having the nucleotide sequences recited in SEQ ID 

NO:l - SEQ ID NO: 18 may be isolated by screening a T. cruzi genomic expression 
library with pools of sera from T. crwzi-infected individuals, as described above. More 
specifically, DNA molecules having the nucleotide sequences recited in SEQ ID NO:l - 
SEQ ID NO: 16 may be isolated by screening the library with a pool of sera that 

10 displays serological reactivity (in an ELISA or Western assay) with parasite lysate 
and/or one or both of the 7*. cruzi antigens TcD and TcE, described in U.S. Patent No. 
5,304,371 and U.S. Serial No. 08/403,379, filed March 14, 1995. A subsequent screen 
is then performed with patient sera lacking detectable anti-TcD antibody. A DNA 
molecule having the nucleotide sequences recited in SEQ ID NO: 17 (5* end) and SEQ 

15 ID NO: 18 (3' end) may be isolated by screening the genomic expression library with a 
pool of sera that displays lower serological reactivity (i.e., detects a signal less than 3 
standard deviations over background reactivity in an ELISA or Western assay) with 
lysate, TcD and TcE, followed by a subsequent screen with patient sera lacking 
detectable anti-TcD antibody. 

20 DNA molecules having the sequences recited in SEQ ID NO: 19 - SEQ 

ID NO:22 may be obtained by screening an unamplified T. cruzi cDNA expression 
library with sera (both higher and lower serological reactivity) from 71 cruz/-infected 
individuals, as described above. 

Alternatively , DNA molecules having the sequences recited in SEQ ID 

25 NO:l - SEQ ID NO:22 may be amplified from T. cruzi genomic DNA or cDNA via 
polymerase chain reaction. For this approach, sequence-specific primers may be 
designed based on the sequences provided in SEQ ID NO: I - SEQ ID NO:22, and may 
be purchased or synthesized. An amplified portion of the DNA sequences may then be 
used to isolate the full length genomic or cDNA clones using well known techniques, 

30 such as those described in Sarnbrook et al., Molecular Cloning: A Laboratory Manual, 
Cold Spring Harbor Laboratories, Cold Spring Harbor, NY (1989). 

Epitopes of antigens having amino acid sequences encoded by the above 
DNA sequences may generally be identified by generating polypeptides containing 
portions of the native antigen and evaluating the reactivity of the polypeptides with sera 

35 from r. crwzi-infected individuals, as described above. In many instances, peptides 
comprising one or more repeat sequences found in the native antigen contain an 
epitope. Such repeat sequences may be identified based on inspection of the above 
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nucleotide sequences. Representative repeat sequences for antigens encoded by the 
above DNA sequences are provided in SEQ ID NO:23 - SEQ ID NO;36 and SEQ ID 
NO:47 - SEQ ID NO:49. More specifically, Tepeat sequences for the sequence recited 
in SEQ ID NO:3 are provided in SEQ ID NO:23 (Frame 1), SEQ ID NO:24 (Frame 2) 
5 and SEQ ID NO:25 (Frame 3). Repeat sequences for the sequence recited in SEQ ID 
NO:4 are provided in SEQ ID NO:26 (Frame t) and SEQ ID NO:27 (Frame 3) and 
repeat sequences for SEQ ID NO:9 are provided in SEQ ID NO:47 (Frame 1), SEQ ID 
NO:48 (Frame 2) and SEQ ID NO:49 (Frame 3). For SEQ ID NO: 12, repeat sequences 
are provided in SEQ ID NO:28 (Frame 1), SEQ ID NO:29 (Frame 2) and SEQ ID 

10 NO:30 (Frame 3). SEQ ID NO:31 recites a repeat sequence for SEQ ID NO:l 5. For 
SEQ ID NO: 16, repeat sequences are provided in SEQ ID NO:32 (Frame 2) and SEQ 
ID NO:33 (Frame 3). Finally, repeat sequences for SEQ ID NO: 18 are provided in 
SEQ ID NO:34 (Frame !), SEQ ID NO:35 (Frame 2) and SEQ ID NO:36 (Frame 3). 

The polypeptides described herein may be generated using techniques 

15 well known to those of ordinary skill in the art. Polypeptides having fewer than about 
100 amino acids, and generally fewer than about 50 amino acids, can be synthesized 
using, for example* the Merrifield solid-phase synthesis method, where amino acids are 
sequentially added to a growing amino acid chain. See Merrifield, J. Am. Chem. Soc. 
35:2149-2146, 1963. Equipment for automated synthesis of polypeptides is 

20 commercially available from suppliers such as Applied Biosystems, Inc., Foster City, 
CA. Thus, for example, polypeptides comprising the above repeat sequences or 
portions thereof, may be synthesized by this method. Similarly, epitopes of other 
native antigens, or variants thereof may be prepared using an automated synthesizer. 

Alternatively, the polypeptides of this invention may be prepared by 

25 expression of recombinant DNA encoding the polypeptide in cultured host cells. 
Preferably, the host cells are E. coli, yeast, an insect cell line (such as Spodoptera or 
Trichoplusia) or a mammalian cell line, including (but not limited to) CHO, COS and 
NS-1. The DNA sequences expressed in this manner may encode naturally occurring 
proteins, such as full length antigens having the amino acid sequences encoded by the 

30 DNA sequences of SEQ ID NO:l - SEQ ID NO:22, portions of naturally occurring 
proteins, or variants of such proteins. Representative polypeptides encoded by such 
DNA sequences are provided in SEQ ID NO:37 - SEQ ID NO:46, SEQ ID NO:52, and 
SEQIDNO:65. 

Expressed polypeptides of this invention are generally isolated in 
35 substantially pure form. Preferably, the polypeptides are isolated to a purity of at least 
80% by weight, more preferably, to a purity of at least 95% by weight, and most 
preferably to a purity of at least 99% by weight. In general, such purification may be 
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achieved using, for example, the standard techniques of ammonium sulfate 
fractionation, SDS-PAGE electrophoresis, and affinity chromatography. 

In another aspect of this invention, methods for detecting T. cruzi 
infection in individuals and Wood supplies are disclosed. In one embodiment, T. cruzi 
5 infection may be detected in any biological sample that contains antibodies. Preferably, 
the sample is blood, serum, plasma, saliva, cerebrospinal fluid or urine. More 
preferably, the sample is a blood or serum sample obtained from a patient or a blood 
supply. Briefly, 7* cruzi infection may be delected using any one or more of the 
polypeptides described above, or variants thereof, to determine the presence or absence 

10 of antibodies to the polypeptide or polypeptides in the sample, relative to a 
predetermined cut-off value. 

There are a variety of assay formats known to those of ordinary skill in 
the art for using purified antigen to detect antibodies in a sample. See. e.g., Harlow and 
Lane, Antibodies: A Laboratory Manual Cold Spring Harbor Laboratory, 1988. In a 

1 5 preferred embodiment, the assay involves the use of polypeptide immobilized on a solid 
support to bind to and remove the antibody from the sample. The bound antibody may 
then be detected using a detection reagent that binds to the antibody/peptide complex 
and contains a detectable reporter group. Suitable detection reagents include antibodies 
that bind to the antibody/polypeptide complex and free polypeptide labeled with a 

20 reporter group (e.g., in a semi-competitive assay). Alternatively, a competitive assay 
may be utilized, in which an antibody that binds to the polypeptide is labeled with a 
reporter group and allowed to bind to the immobilized antigen after incubation of the 
antigen with the sample. The extent to which components of the sample inhibit the 
binding of the labeled antibody to the polypeptide is indicative of the reactivity of the 

25 sample with the immobilized polypeptide. 

The solid support may be any solid material known to those of ordinary 
skill in the art to which the antigen may be attached. For example, the solid support 
may be a test well in a microliter plate or a nitrocellulose or other suitable membrane. 
Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a 

30 plastic material such as polystyrene or polyvinylchloridc. The support may also be a 
magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. 
Patent No. 5,359,681. 

The polypeptide may be bound to the solid support using a variety of 
techniques known to those in the art, which are amply described in the patent and 

35 scientific literature. In the context of the present invention, the term "bound" refers to 
both noncovalent association, such as adsorption, and covalent attachment (which may 
be a direct linkage between the antigen and functional groups on the support or may be 
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a linkage by way of a cross-linking agent). Binding by adsorption to a well in a 
microtiter plate or to a membrane is preferred. In such cases, adsorption may be 
achieved by contacting the polypeptide, in a suitable buffer, with the solid support for a 
suitable amounl of time. The contact time varies with temperature, but is typically 
5 between about 1 hour and 1 day. In generah contacting a well of a plastic miemtiter 
plate (such as polystyrene or polyvinylchloride) with an amount of polypeptide ranging 
from about 10 ng to about 1 jig, and preferably about 100 ng, is sufficient to bind an 
adequate amount of antigen. Nitrocellulose will bind approximately 100 jig of protein 
per cm\ 

10 Covalent attachment of polypeptide to a solid support may generally be 

achieved by first reacting the support with a Afunctional reagent that will react with 
both the support and a functional group, such as a hydroxy! or amino group, on the 
polypeptide, l or example, the polypeptide may be bound to supports having an 
appropriate polymer coating using benzoquinone or by condensation of an aldehyde 

1 5 group on the support with an amine and an active hydrogen on the polypeptide {see. 
Pierce Immunotechnology Catalog and Handbook (1 991) at A12-A1 3). 

In certain embodiments, the assay is an enzyme linked immunosorbent 
assay (ELISA). This assay may be performed by first contacting a polypeptide antigen 
that has been immobilized on a solid support, commonly the well of a microtiter plate, 

20 with the sample, such that antibodies to the polypeptide within the sample are allowed 
to bind to the immobilized polypeptide. Unbound sample is then removed from the 
immobilized polypeptide and a detection reagent capable of binding to the immobilized 
antibody-polypeptide complex is added. The amount of detection reagent that remains 
bound to the solid support is then determined using a method appropriate for the 

25 specific detection reagent. 

Once the polypeptide is immobilized on the support, the remaining 
protein binding sites on the support arc typically blocked. Any suitable blocking agent 
known to those of ordinary skill in the art, such as bovine serum albumin or Tween 20 
™ (Sigma Chemical Co., St. Louis, MO). The immobilized polypeptide is then 

30 incubated with the sample, and antibody (if present in the sample) is allowed to bind to 
the antigen. The sample may he diluted with a suitable diluent, such as phosphate- 
buffered saline (PBS) prior to incubation. In general, an appropriate contact time (i.e.. 
incubation time) is that period of time that is sufficient to permit detect the presence of 
T. cruzi antibody within a T. C7*«z/-infected sample. Preferably, the contact time is 

35 sufficient to achieve a level of binding that is at least 95% of that achieved at 
equilibrium between bound and unbound antibody. Those of ordinary skill in the art 
will recognize that the time necessary to achieve equilibrium may be readily determined 
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by assaying the level of binding that occurs over a period of time. At room 
temperature, an incubation time of about 30 minutes is generally sufficient. 

Unbound sample may then be removed by washing the solid support 
with an appropriate buffer, such as PBS containing 0.1% Tween 20™. Detection 
5 reagent may then be added to the solid support. An appropriate detection reagent is any 
compound that binds to the immobilized antibody-polypeptide complex and that can be 
detected by any of a variety of means known to those in the art. Preferably, the 
detection reagent contains a binding agent (such as. for example, Protein A. Protein <j, 
immunoglobulin, lectin or free antigen) conjugated to a reporter group. Preferred 

10 reporter groups include enzymes (such as horseradish peroxidase), substrates, cofactors, 
inhibitors, dyes, radionuclides, luminescent groups, fluorescent groups and biotin. The 
conjugation of binding agent to reporter group may be achieved using standard methods 
known to those of ordinary skill in the art. Common binding agents may also be 
purchased conjugated to a variety of reporter groups from many sources Zymed 

1 5 Laboratories, San Francisco, CA and Pierce, Rockford, IL). 

The detection reagent is then incubated with the immobilized antibody- 
polypeptide complex for an amount of time sufficient to detect the bound antibody. An 
appropriate amount of time may generally be determined from the manufacturers 
instructions or by assaying the level of binding that occurs over a period of time. 

20 Unbound detection reagent is then removed and bound detection reagent is detected 
using the reporter group. The method employed for detecting the reporter group 
depends upon the nature of the reporter group. For radioactive groups, scintillation 
counting or autoradiographic methods are generally appropriate. Spectroscopic 
methods may be used to detect dyes, luminescent groups and fluorescent groups. 

25 Biotin may be detected using avidin, coupled to a different reporter group (commonly a 
radioactive or fluorescent group or an enzyme). Enzyme reporter groups may generally 
be detected by the addition of substrate (generally for a specific period of time), 
followed by spectroscopic or other analysis of the reaction products. 

To determine the presence ur absence of T. cruzi antibodies in the 

.10 sample, the signal detected from the reporter group that remains bound to the solid 
support is generally compared to a signal that corresponds to a predetermined cut-off 
value. This cut-off value is preferably the average mean signal obtained when the 
immobilized antigen is incubated with samples from an uninfected patient. In general, 
a sample generating a signal that is three standard deviations above the mean is 

35 considered positive for T. cruzi antibodies and T. cruzi infection. In an alternate 
preferred embodiment, the cut-off value is determined using a Receiver Operator 
Curve, according to the method of Sackett et al., Clinical Epidemiolog}*: A Basic 
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Science for Clinical Medicine, p. 106-7 (Little Brown and Co., 1985). Briefly, in this 
embodiment, the cut-off value may be determined from a plot of pairs of true positive 
rates (i.e. sensitivity) and false positive rates (100%-spccificity) that correspond to 
each possible cut-off value for the diagnostic test result. The cut-off value on the plot 
5 that is the closest to the upper left-hand corner (i.e.. the value that encloses the largest 
area) is the most accurate cut-off value, and a sample generating a signal that is higher 
than the cut-off value determined by this method may be considered positive. 
Alternatively, the cut-off value may be shifted to the left along the plot, to minimize the 
false positive rate, or to the right, to minimize the false negative rate. In general, a 
10 sample generating a signal that is higher than the cut-off value determined by this 
method is considered positive for T. cruzi infection. 

In a related embodiment, the assay is performed in a flow-through or 
strip test format, wherein the antigen is immobilized on a membrane such as 
nitrocellulose. In the flow-through test, antibodies within the sample bind to the 
15 immobilized polypeptide as the sample passes through the membrane. A detection 
reagent (e.g., protein A-colloidal gold) then binds to the antibody-polypeptide complex 
as the solution containing the detection reagent flows through the membrane. The 
detection of bound detection reagent may then be performed as described above. In the 
strip test format, one end of the membrane to which polypeptide is bound is immersed 
20 in a solution containing the sample. The sample migrates along the membrane through 
a region containing detection reagent and to the area of immobilized polypeptide. 
Concentration of detection reagent at the polypeptide indicates the presence of T. cruzi ' 
antibodies in the sample. Such tests can typically be performed with a very small 
amount (e.g., one drop) of patient serum or blood. 
25 The assays discussed above may be performed using one or more of the 

polypeptides described herein. Alternatively, the sensitivity may be improved by using 
epitopes of one or more additional T. cruzi antigens in combination with the above 
polypeptide(s). In particular, epitopes of TcD (disclosed, for example, in U.S. Patent 
No. 5,304,371), PEP-2 and/or TcE (both of which arc disclosed, for example, in U.S. 
30 Serial No. 08/403,379, filed March 14, 1995) may be used in conjunction with the 
above polypeptide(s). The PEP-2 antigenic epitope is also discussed in Peralta et al., ./. 
Clin. Microbiol. .32:971-74, 1994. The sequence of TcD is provided in SEQ ID NO:50, 
the sequence of TcE is provided in SEQ ID NO:51. The TcD antigenic epitope 
preferably has the amino acid sequence Ala Glu Pro Lys Ser Ala Glu Pro Lys Pro Ala 
35 Glu Pro Lys Ser (SEQ ID NO:53)or the amino acid sequence Ala Glu Pro Lys Pro Ala 
Glu Pro Lys Ser Ala Glu Pro Lys Pro (SEQ ID NO:54). The TcE epitope preferably 
has the amino acid sequence Lys Ala Ala lie Ala Pro Ala Lys Ala Ala Ala Ala Pro Ala 
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Lys Ala Ala Thr Ala Pro Ala (SllQ ID NO: 55) or the amino acid sequence Lys Ala Ala 
Ala Ala Pro Ala Lys Ala Ala Ala Ala Pro Ala Lys Aia Ala Ala Ala Pro Ala (SEQ ID 
NO:56), and the PEP2 epitope preferably has the amino acid sequence Gly Asp Lys Pro 
Ser Pro Phe Gly Gin Ala Ala Ala Gly Asp Lys Pro Ser Pro Phe Gly Gin Ala (SEQ ID 
5 NO:57). 

Additional epitopes may be present within the same polypeptide (i.e.. in 
a combination polypeptide) or may he included in separate polypeptides. Preferably, 
the polypeptides are immobilized by adsorption on a solid support such as a well of a 
microtiter plate or a membrane, as described above, such that a roughly similar amount 

10 of each polypeptide contacts the support, and such that the total amount of polypeptide 
in contact with the support ranges from about 1 ng to about 1 0 fig. The remainder of 
the steps may generally be performed as described above. 

The polypeptides described above may also be used following diagnosis 
using one or more of the epitopes from TcD. TcE and/or PKP2. In this embodiment. 

15 the polypeptides of the present invention are used to confirm a diagnosis of T. cruzi 
infection based on a screen with TcD, TcE and/or PEP2. Diagnosis of T. cruzi 
infection using epitopes from TcD, TcE and/or PEP2 is described in U.S, Serial No. 
08/403,379, filed March 14. 1995. 

In yet another aspect of this invention, methods arc provided for 

20 detecting T. cruzi in a biological sample using monospecific antibodies (which may be 
polyclonal or monoclonal) to one or more epitopes, as described above. Antibodies to 
purified or synthesized polypeptides may be prepared by any of a variety of techniques 
known to those of ordinary skill in the art. See. e.g., Harlow and Lane. Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory. 1988. In one such technique, an 

25 immunogen comprising the antigenic polypeptide is initially injected into any of a wide 
variety of mammals (e.g., mice, rats, rabbits, sheep and goats). In this step, the 
polypeptides of this invention may serve as the immunogen without modification. 
Alternatively, particularly for relatively short polypeptides, a superior immune response 
may be elicited if the polypeptide is joined to a carrier protein, such as bovine scrum 

30 albumin or keyhole limpet hemocyanin. The immunogen is injected into the animal 
host, preferably according lo a predetermined schedule incorporating one or more 
booster immunizations, and the animals are bled periodically. Polyclonal antibodies 
specific for the polypeptide may then be purified from such antisera by, for example, 
affinity chromatography using the polypeptide coupled to a suitable solid support. 

35 Monoclonal antibodies specific for the antigenic polypeptide of interest 

may be prepared, for example, using ihe technique of Kohler and Milstein, Eur. J. 
Immunol (5:511-519, 1976, and improvements thereto. Brietly, these methods involve 
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the preparation of immortal cell lines capable <>r producing antibodies having the 
desired specificity (/.£., reactivity with the polypeptide of interest). Such cell lines may 
be produced, for example, from spleen cells obtained from an animal immunized as 
described above. The spleen cells are then immortalized by, for example, fusion with a 
5 myeloma cell fusion partner, preferably one that is syngeneic with the immunized 
animal. A variety of fusion techniques may be employed. For example, the spleen 
cells and myeloma cells may be combined with a nonionie detergent for a few minutes 
and then plated at low density on a selective medium that supports the growth of hybrid 
cells, but not myeloma cells. A preferred selection technique uses HAT (hypoxanthinc, 
10 aminopterin, thymidine) selection. After a sufficient time, usually about 1 to 2 weeks, 
colonies of hybrids are observed. Single colonies are selected and tested for binding 
activity against the polypeptide. Ilybridomas having high reactivity and specificity are 
preferred. 

Monoclonal antibodies may be isolated from the supematants of 

15 growing hybridoma colonies. In addition, various techniques may be employed to 
enhance the yield, such as injection of the hybridoma cell line into the peritoneal cavity 
of a suitable vertebrate host, such as a mouse. Monoclonal antibodies may then be 
harvested from the ascites fluid or the blood. Contaminants may be removed from the 
antibodies by conventional techniques, such as chromatography, gel filtration, 

20 precipitation, and extraction* 

Monospecific antibodies to epitopes of one or more of the polypeptides 
described herein may be used to detect T. cruzi infection in a biological sample using 
any of a variety of immunoassays, which may be direct or competitive. Suitable 
biological samples for use in this aspect of the present invention arc as described above. 

25 Briefly, in one direct assay format, a monospecific antibody mav be immobilized on a ** 
solid support (as described above) and contacted with the sample to be tested. After 
removal of the unbound sample, a second monospecific antibody, which has been 
labeled with a reporter group, may be added and used to detect bound antigen. In an 
exemplary competitive assay, the sample may be combined with the monoclonal or 

30 polyclonal antibody, which has been labeled with a suitable reporter group. The 
mixture of sample and antibody may then be combined with polypeptide antigen 
immobilized on a suitable solid support. Antibody that has not bound to an antigen in 
the sample is allowed to bind to the immobilized antigen, and the remainder of the 
sample and antibody is removed. The level of antibody bound to the solid support is 

35 inversely related to the level of antigen in the sample. Thus, a lower level of antibody 
bound to the solid support indicates the presence of T. cruzi in the sample. To 
determine the presence or absence of T. cruzi infection, the signal detected from the 



\ 
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reporter group that remains bound to the solid support is generally compared to a signal 
that corresponds to a predetermined cut-off value. Such cut-off values may generally 
be determined as described above. Any of the reporter groups discussed above in the 
context of ELISAs may be used to label the monospecific antibodies, and binding may 
5 be detected by any of a variety of techniques appropriate for the reporter group 
employed. Other formats for using monospecific antibodies to detect T. cmzi in a 
sample will be apparent to those of ordinary skill in the art, and the above formats is 
provided solely for exemplary purposes. 

In another aspect of this invention, vaccines and pharmaceutical 

10 compositions are provided for the prevention of T vruzi infection, and complications 
thereof, in a mammal. The pharmaceutical compositions generally comprise one or 
more polypeptides, containing one or more epitopes of 7'. vruzi proteins, and a 
physiologically acceptable carrier. The vaccines comprise one or more of the above 
polypeptides and an adjuvant, for enhancement of the immune response. 

15 Routes and frequency of administration and polypeptide doses will vary 

from individual to individual and may parallel those currently being used in 
immunization against other protozoan infections. In general, the pharmaceutical 
compositions and vaccines may be administered by injection (e.g.* intramuscular, 
intravenous or subcutaneous), intranasally by aspiration) or orally. Between 1 

20 and 4 doses may be administered for a 2-6 week period. Preferably, two doses are 
administered^ with the second dose 2-4 weeks later than the first. A suitable dose is an 
amount of polypeptide that is effective to raise antibodies in a treated mammal that arc 
sufficient to protect the mammal from f. cruzi infection for a period of time. In 
general, the amount of polypeptide present in a dose ranges from about 1 pg to about 

25 100 mg per kg of host, typically from about 10 pg to about I mg. and preferably from 
about 100 pg to about 1 ng. Suitable dose sizes will vary with the size of the animal, 
but will typically range from about 0.01 mL to about 5 ml for 1 0-60 kg animal . 

While any suitable carrier known to those of ordinary skill in the art may 
be employed in the pharmaceutical compositions of this invention, the type of carrier 

30 will vary depending on the mode of administration. For parenteral administration, such 
as subcutaneous injection, the carrier preferably comprises water, saline, alcohol, a fat, 
a wax or a buffer. For oral administration, any of the above carriers or a solid carrier, 
such as mannitol, lactose, starch, magnesium stcaratc, sodium saccharine, talcum, 
cellulose, glucose, sucrose, and magnesium carbonate, may be employed. 

35 Biodegradable microspheres {e.g., polylactic galactide) may also be employed as 
carriers for the pharmaceutical compositions of this invention. 
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Any of a variety of adjuvants may be employed in the vaccines of this 
invention to nonspecifically enhance the immune response. Most adjuvants coniain a 
substance designed to protect the antigen from rapid catabolisiru such as aluminum 
hydroxide or mineral oil, and a nonspecific stimulator of immune response, such as 
5 lipid A, Horde I la pertussis or Mycobacterium tuberculosis. Such adjuvants arc 
commercially available as, for example, Freund's Incomplete Adjuvant and Complete 
Adjuvant (Difco Laboratories, Detroit, MI) and Merck Adjuvant 65 (Merck and 
Company, Inc., Rahway. NJ). 

0 The following Examples are offered by way or illustration and not by 

way of limitation. 
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EXAMPLES 



Example 1 

5 Preparation of DNA Encoding 7: cruzi Antigens 

This Example illustrates the preparation of genomic and cDNA 
molecules encoding 7. cruzi Antigens. 



A. Preparation of Genomic Clones 

1 0 A genomic expression library was constructed from randomly sheared T. 

cruzi genomic DNA (Tulahuen C2 strain) using the Lambda ZAP expression system 
(Stratagene, La Jolla, CA) according to the manufacturer's instructions. In one screen, 
the library' was screened with a pool of sera from five patients that displayed high 
reactivity with parasite lysate and/or one or both of the T. cruzi antigens TcD and TcE, 

15 described in U.S. Patent No. 5.304,371 and U.S. Serial No. 08/403,379, filed March 14, 
1995. Each of the five patients* sera was determined to be reactive based on Western 
and ELISA assays with whole lysate and/or TcD or TcE. Anti-/;. coli reactivity was 
removed from the serum prior to screening by adsorption. 50.000 pfu of the 
unampliticd library was screened with the scrum pool and plaques expressing proteins 

20 that reacted with the serum were detected using protein A-horseradish peroxidase (with 
the ABTS substrate). A subsequent screen was then performed with a pool of sera from 
three patients lacking detectable anti -TcD antibody in Western and ELISA assays using 
recombinant TcD. 

A similar screen was performed using a pool of sera that displayed low 

25 reactivity with lysate, TcD and TcE (i.e.. delected a signal less than 3 standard 
deviations over background reactivity in an ELISA or Western assay), followed by a 
subsequent screen with patient sera lacking detectable anti-TcD antibody, as described 
above. 

Twenty-eight clones that expressed proteins which reacted with both 
30 pools of sera in at least one of the above screens were then isolated. Excision of the 
pBSK(-) phagemid (Stratagene, Inc., LaJolla, CA) was carried out according to the 
manufacturer's protocol. Overlapping clones were generated by exo nuclease 111 
digestion and single-stranded templates were isolated after infection with VCSM 13 
helper phage. The DNA was sequenced by the dideoxy chain termination method or by 
35 the Taq di-terminator system, using an Applied Biosystem automated sequencer. Model 
373A. 
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Of the 28 clones, five had been reported previously, two were identical, 
and eight contained identical peptide sequences represented by a degenerate 42 base 
pair repeat. SEQ ID NO: 16 shows the prototype clone containing the 42 base pair 
repeat sequence. Accordingly, 14 novel DNA sequences encoding T. cruzi antigens 
5 were prepared using the above screen with the reactive pool of sera (shown in SEQ ID 
NO:l - SEQ ID NO:16 ? where SEQ ID NO:4 and SEQ ID NO:5 represent the 5* and .V 
ends, respectively, of a single clone, SEQ ID NO:9 and SEQ ID NO: 10 represent the 5" 
and 3* ends, respectively, of a single clone. One novel sequence was obtained with the 
screen employing the sera with low reactivity (shown in SEQ ID NO: 17 (5' end) and 
1 0 SEQ ID NO: 1 8 (3 * end)), 

B, Preparation of cDN A Clones 

Poly A+ RNA was purified from the intracellular amastigote stage of T 
cruzi (Tulahuen C2 strain). The RNA was reverse transcribed and used in the 

15 construction of a unidirectional cDNA expression library in the Lambda UniZap 
expression vector (Stralagene, La Jolla, CA) according to the manufacturer's 
instructions. 50.000 pfu of the unamplifled library was screened with a serum pool 
containing patient sera that displayed both high and low serological reactivity, followed 
by a subsequent screen with patient sera lacking detectable anti-TcD antibody, as 

20 described above. A total of 32 clones were isolated from this screen. Twenty-five of 
these cJoncs were proteins of the translalional apparatus that have been previously 
identified as highly immunogenic, and all were different from the clones identified by- 
screening the genomic expression library. The remaining seven are represented by the 
sequences provided in SEQ ID NO: 19 - SEQ ID NO:22. The sequence recited in SEQ 

25 ID NO:22 is that of T. cruzi ubiquilin. 

Example 2 
Synthesis of Synthetic Polvcntides 
This Example illustrates the synthesis of polypeptides having sequences 
30 derived from T. cruzi antigens described herein. 

Polypeptides may be synthesized on a Millipore 9050 peptide 
synthesizer using FMOC chemistry with HBTIJ (0-bcnzotriazole-RN,\\NT- 
telramethyuronium hexafluorophosphate) activation. A gly-cys-gly sequence may be 
attached to the amino or carboxyl terminus of the peptide to provide a method of 
35 conjugation or labeling of the peptide. Cleavage of the peptides from the solid support 
may be carried out using the following cleavage mixture: trifluoroacetie 
acid:ethanediol:thioanisoie:waier:phenol (40:1:2:2:3). After cleaving for 2 hours, the 
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peptides may be precipitated in cold methyl-t-bulyl-clher. The peptide pellets may then 
be dissolved in water containing 0.1% trifluoroacetic acid (TFA) and lyophilized prior 
to purification by C18 reverse phase HPLC. A gradient of ()%-60% acetonkrile 
(containing 0.1% TFA) in water (containing O.I%TFA) may be used to ehite the 
5 peptides. Following lyophilization of the pure fractions, the peptides are characterized 
using electrospray mass spectrometry and by amino acid analysis. 

This procedure was used to synthesize peptides such as Tce22-l.Tcc22- 
1+, Tcc22-2,l (contained within SEQ ID NO:41), TcLol.lJ.2 and 1.3 (contained 
within SEQ ID NOs 34, 35 and 36) and TcHilO.l and 10.3 (SEQ ID NOs 26 and 27) 
1 0 which have the following sequences: 

Tcc22-1 VRASNCRKKACGHCSNLR.V1KKK 
TCC22-1+ EALAKKYKVVEKKVCRRCYARLPVRASNCRXKACXiHrSNLRMIvKK 

15 

Tcc22-2.1 VLRL RGG VM EPTLEA L AKK Y N WRK K VC RRC Y A RL 
TcLol.I U Y V RGRKQR WQLH ACG Y V RG R KQRRQ L> 1 ACG Y V RG RKQR WQt . ( I A P 
20 TcLol.2 GTSEEGSRGGSSMPSGTSEEGSRGGSSMPA 
TcLol .3 VRPRKRAFVA APCLRVRPRKEAEEAAPCLR 
TcHilO.l SVPGKRLRNSHGKSLRNVHGKRPKNEHGKRLRSVPNERLR 

25 

TcIIi 1 0.3 EAEELARQESEERARQEAEERAWQEAEERAQRKAEERAQR 

Example 3 

Serological Reactivity of T. cruzi Recombinant Antigens 
30 This example illustrates the diagnostic properties of several recombinant 

antigens found to be serologically active. This includes studies of reactivity with 7~. 
cruzi positive and negative sera as well as cross reactivity studies with sera from 
patients with other diseases. 

Assays were performed in 96 well plates (Corning Easiwash, Corning, 
35 New York). Wells were coated in 50|il of carbonate coating buffer pH 9.6. For T.cruzi 
lysatc, lOOng/well was used, and for each of the recombinant antigens 200ng/well was 
used. The wells were coated overnight at 4°C (or 2 hours at 37°C). The plate contents 
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were then removed and wells were blocked for 2 hours with 200ul of PBS/I %BSA, 
After the blocking step, the wells were wushed five times with PBS/0.1% Twecn 20™. 
50^1 of sera (cither positive or negative for T. cruzi infection), diluted 1:50 in 
PBS/0.1% Tween 20/0J%BSA was then added to each well and incubated for 30 
5 minutes al room temperature. The plates were then washed again five times with 
PBS/0. 1 % Tween 20™. 

The enzyme conjugate (horse radish peroxidase-Protein A, Zymed. San 
Francisco. CA) was then diluted 1:20.000 in PBS/0.1% Tween 20™/0.1%BSA, and 
50mI of the diluted conjugate was added to each well and incubated for 30 minutes at 

10 room temperature. Following incubation the wells were again washed five times with 
PBS/0.1% Twecn 20™. 100^1 of the peroxidase substrate, tetramethylbenzidine 
(Kirkegaard and Perry Laboratories. Oaithersburg, MD) was added, undiluted, to each 
of the wells and incubated for 15 minutes. I*he reaction was stopped by the addition of 
lOOpl of IN H 2 SOj to each well, and the plates were read at 450nm. 

!5 Figure 1 shows the reactivity of the recombinant rTcc6 (SEQ ID NO:39) 

as compared to that of T. cruzi lysate. Based on a cutoff of the mean of the negatives 
plus 3 standard deviations, 49 out of 50 serum samples were positive with lysate. and 
34 out of 50 were positive with rTcc6. In a similar study (shown in Figure 2\ the 
recombinant rTcc22 (SEQ ID NO:41) was found to have a sensitivity of 79.2% (38 out 

20 of 48 serum samples were positive). Comparative studies of the recombinant rTcc3K 
(SEQ ID NO:38) with T. cruzi lysate using similar criteria showed that 24/39 were 
positive compared with 39/39 for lysate (Figure 3). Tcc38 when tested with potentially 
cross reacting sera showed improved specificity over T. cruzi lysate. 

The recombinant TcHil2 (SEQ ID NO:37) was also found to be 
25 immunoreactive (Figure 2) having a sensitivity of 62.5% ( 1 5/24), 

Example 4 

Serological Reactivity of T. cruzi Synthetic Peptide Amip ens 
This example illustrates the diagnostic properties of several of the 
30 peptides described in Example 2. These peptides were tested for reactivity with T. cruzi 
positive and negative sera and, in some cases, for cross reactivity with sera from 
patients with other, potentially cross reactive, diseases. 

The first group of peptides included different reading frames to 
determine the most reactive repeat sequence. The peptides tested were TcLol.l 
35 (contained within SEQ ID NO:34), TcLol.2 (contained within SEQ ID NO:35) and 
TcLol.3 (contained within SEQ ID NO:36). representing reading frames 1, 2 and 3 of 
the DNA sequence provided in SEQ ID NO: 18, and TcHilO.l (SEQ ID NO:26) and 
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TcHi10.3 (SEQ ID NO:27) which represent two of the reading frames for the TeHilO 
sequence (shown in SEQ ID NO:5). The data is shown in Figure 4. In the case of the 
TcLo frames, both the TcLol.l and 1.2 peptides were strongly reactive but the TcLol. 2 
was superior in signal to noise when tested on sera from T. cruzi positive and negative 
5 individuals. TcLol.3 had lower signal but also low background. In this study lysale 
detected 24/24 positives, TcLol.l detected 21/24 f TcLol. 2 detected 23/24 and TcLol. 3 
detected 15/24. In the same study, the two frames TcHilO.l and 10.3 detected 19/24 
and 14/24 positives respectively, but with lower signal than for TcLol. Cross reactivity 
studies with these different reading frames demonstrate that TcLol .2 has minimal cross 

10 reactivity with the sera tested (Figure 5) as compared to T.cruzi lysate. 

As discussed in Example 2. overlapping peptides were also synthesized 
for rTcc22 to determine the active epitope. The peptides Tcc22-1 , H and 2 were tested 
with T. cruzi positive and negative sera The results are shown in Figure 2. The Tcc22- 
1+ and Tcc22-2.1 peptides were more reactive than the Tcc22-I peptide. In the first 

15 experiment, Tcc22-1 and Tcc22-1+ detected 29/48 and 36/48 positives as compared to 
the recombinant Tcc22 which detected 38/48 positives, In a subsequent experiment, 
Tcc22-2. 1 was also shown to be reactive but with less signal than Tcc22-1+ at the same 
plate coating level. 

A polypeptide having the TcHiJ5 frame 3 repeat sequence (SEQ ID 

20 NO:49) was also synthesized and tested in an ELISA assay using a coating level of 200 
ng/well. A total of 48 T. cruzi positive sera and 26 negative sera were tested in order to 
determine the reactivity of this peptide sequence. In this study, the peptide had a 
sensitivity of 68.75% (detecting 33 out of 48 positives) and a specificity of 92.3% (24 
out of 36 negatives), indicating that this polypeptide has potential significance in 

25 delecting T. cruzi infections. The results of this assay are prefiented in Tabic 3 , below. 



Table 1 

Reactivity of TcIIilS Frame 3 Polypeptide with T. cmrr-Positive and Negative Sera 



Sample ID 


7*. cruzi Status 


OD 450 


Sample ID 


T. cruzi Status 


OD450 


IcOll 095-1 


Positive 


0.696 


DL4-0I06 


Negative 


0.167 


TcO 11095-2 


Positive 


0.699 


DL4-0N2 


Negative 


0.05 


TcOl 1095-3 


Positive 


1.991 


DL4-0127 


Negative 


0.240 


TcO 1 1095-4 


Positive 


3 


DL4-0140 


Negative 


o.oos 


TcOl 1095-5 


Positive 


0.098 


DL4-0I45 


Negative 


0.107 


TcO 11095-6 


Positive 


0.238 


DL4-0161 


Negative 


0.M9 


TcOl 1095-7 


Positive 


0.1 15 


DM-0162 


Negative 


1.187 


TcOl 1095-8 


Positive 


0.156 


DL4-0I66 


Negative 


0.210 


TcOl 1095-9 


Positive 


0.757 


DU-0167 


Negative 


0.131 


TcO 1 1 095- 1 0 


Positive 


U47 


DL4-OI72 


Negative 


0.073 
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Sample ID 


T. cruzi Status 


OD450 


Sample ID 


7'. cruzi Status 


OD 450 


TcOl 1095- ! 1 


Positive 


0.264 


DL4-0I75 


Negative 


0.1 17 


TcOJ 1095-12 


Positive 


1.7 


DL4-0176 


Negative 


0.815 


TcOl 1095- J 3 


Positive 


1.293 


AT4-0013 


Negative 


0.100 


TcOl 1095-14 


Positive 


0.242 


AT4-0041 


Negative 


0.107 


TcO 11095-15 


Positive 


0.636 


AT4-0062 


Negative 


0.28 


TcOl 1095-16 


Positive 


0.44 


AT4-0063 


Negative 


0.155 


TcOl 1095-17 


Positive 


3 


R4-005 1 


Negative 


0,162 


TcOl 1095-18 


Positive 


1.651 


F.4-0059 


Negative 


0 176 

\r r I/O 


TcOl 1095-19 


Positive 


0.19 


E4-0068 


Neirarive 


0 24 1 


TcOl) 095-20 


Positive 


0.916 


E4-007 1 




Li. \ J. f 


TcOl 1095-21 


Positive 


0.715 


("4-0072 




o i n i 


TcOJ 1095-22 


Positive 


1.336 


C4-0088 




V. 1 ^ 1 


TcOl 1095-23 


Positive 


1.037 


C4-0090 




O 07R 


TcOl 1095-24 


Positive 


0.332 


C4-0096 


Negative 


0162 


TcOl 1095-25 


Positive 


0.4 J 3 


C4-OI0I 


Negative 


0. 1 S ] 


TcOl 1095-26 


Positive 


0.266 


C4-0105 


Neeaiive 


0 7fP 


TcOl 1095-27 


Positive 


1.808 








TcOl 1095-28 


Positive 


0.238 








TcOl 1095-29 


Positive 


0.266 








TcOl 1095-30 


Positive 


1.563 








TcOl 1095-3) 


Positive 


0.352 


Sens it i vitv 


33/48 


DO. : J> t U 


TcOl 1095-32 


Positive 


0.208 


Specificity 


24/26 


07 10°.'<. 


TcOl 1095-33 


Positive 


0.6S6 


Mean Pos. 


0.9188 




TcO 1 1 095-34 


Positive 


1.281 


Std Dev Pos . 


0.79 




TcOl 1095-35 


Positive 


0,907 


Mean Neg, 


0.1508 




TcOl 1095-36 


Positive 


0.429 


Std Dev Neg. 


0.06695 




TcO 1 1095-37 


Positive 


0,454 






TcOl 1095-38 


Positive 


0.725 








TcOl 1095-39 


Positive 


0.703 








Tc0394-7 


Positive 


0,186 








Tc0394-8 


Positive 


1.06 








Te0394-9 


Positive 


1.813 








Tc0394-I0 


Positive 


0.131 








Tc0394-I 1 


Positive 


1.631 








Tc0394-12 


Positive 


0.613 








Tc0394-13 


Positive 


3 








Tc0394-I4 


Positive 


0.268 








Tc0394-I5 


Positive 


2 211 


i 







Example 5 

Serological Reactivity of Peptide Combinations 
This example illustrates the diagnostic properties of several peptide 

5 combinations. 

The TcLol.2 peptide (contained within SEQ ID NO:35) was tested in 
combination with the synthetic peptide TcD and also the dual epitope peptides D/2 
(which contains ihe TcD and the PEP-2 sequences) and D/E (which contains TcD and 
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TcE sequences). These combinations were compared with the individual peptides as 
well as the tripeptide 2/D/E, which contains TcD, TcE and PEP-2. The TcD .sequence 
used was Ala Glu Pro Lys Ser Ala Glu Pro Lys Pro Ala GIu Pro Lys Ser (SEQ ID 
NO:53L the TcE sequence was Lys Ala Ala Fie Ala Pro Ala Lys Ala Ala Ala Ala Pro 
5 Ala Lys AJa Ala Thr Ala Pro Ala (SEQ ID NO: 55), and the PLP2 sequence was Gly 
Asp Lys Pro Ser Pro Phe Gly Gin Ala Ala Ala Gly Asp Lys Pro Ser Pro Phe Gly Gin 
Ala (SHQ ID NO: 57), 

The data are shown in Figure 6. The results show that TcLoL2 can 
augment the reactivity of TcD, D/2 and D/E, as summarized in Table 2. 

10 

Table 2 

Sensitivity of Peptide Combinations in the Detection of 7. cruzi Infection 



Peptides j Number of Positives 


TcD 


62/67 


TcE 


50/67 


PEP-2 


66/67 


TcLol.2 


61/67 


TcD+TcLol.2 


66/67 


D/2+TcLoI.2 


67/67 


D/E+Tcl.ol.2 


67/67 


2/D/E 


67/67 



15 These results demonstrate the use of T. cruzi antigens as described 

herein to enhance the serodiagnostic properties of other antigens. 

Example 6 

Serological Reactivity oPTcE Repeal Sequences 
20 This example illustrates the diagnostic properties of several TcE repeat 

sequences. 

The repeal sequence region of the recombinant TcE contains several 
degeneracies, resulting in residues where an A (alanine), T (threonine) or I (Lsoleucine) 
can be present in the repeat sequence. In order to represent all degeneracies, the 
25 original sequence for the synthetic TcE peptide was made with an A, T and I in a single 
peptide containing three repeats (see Example 5). In order to further epitope map the 
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)0 



repeat region and to determine the number of repeats required for serological activity, 
the following peptides were prepared as described in Example 2: 

original TcE K A Al APAKA A A AP AKA ATAPA (SEQ ID NO:55) 

TcE(3A) KAAAAPAKAAAAPAKAAAAPA (SEQ ID NO:58) 

TcE(3T) KA ATAPAKAATAPA K AATAP A (SEQ ID NO:59) 

TcE(3I) K A A I AP AKA Al AP A KA A 1 APA (SEQ ID NO:60) 

TcE(2A) KAAAAPAKAAAAPA (SEQ ID NO;61 ) 

TcH(AT) KAAAAPAKAATAPA (SEQ ID NO:62) 



The serological reactivity of these peptides was then compared. A total 
of 24 positive and 21 negative sera were tested with each of the TcE variants as the 
solid phase in an ELISA assay performed as described in Example 3. using 25 ng/well 
of peptide. The reactivity of the different peptides is shown in Figure 7. The highest 

1 5 reactivity was seen with the 3-rcpcat peptide in which each repeat contained an A at the 
degenerate residue (TcE(3A)). This peptide displayed even higher reactivity than the 
original TcE sequence containing an A, T and I residue in the three repeats. The 31 and 
3T variants by contrast were essentially negative with the T. cruzi positive samples 
tested. The sequence containing two repeats with A (TcE(2A)) was clearly less reactive 

20 than the 3 A sequence and the two repeat sequence with an A and a T (TcE(AT)) was 
negative. Based on a cutoff of the mean of the negatives plus three standard deviations, 
the original TcE (A,T,I) detected 17 out of 24 positives and the 3 A variant detected 19 
out of 24 positives. It also appears that to obtain maximal serological activity al least 
three repeats are required. 

25 

Example 7 

Serological Reactivity of Multi-epitope Peptide Combinations 
This example illustrates the diagnostic properties of several multi- 
epitope peptide combinations. 

30 Two dipeptides PEP-2/TcLol.2, which contains the PEP-2 (SEQ ID 

NO:57) and TcLoI.2 (SEQ ID NO:35) sequences, and TcD/TcE, which contains the 
TcD (SEQ ID NO:53) and TcE (SEQ ID NO:55) sequences, were synthesized as 
described above in Example 2. The reactivity of these two dipeptides with 7. cruzi 
antibody-positive sera was compared to that of the tripeptidc 2/D/E. ELISA s were 

35 performed as described in Example 3 using PEP-2/TcLol .2 at 250ng/well and TcD/TcE 
at 50ng/well. The results of this study are shown in Figure 8. One T. cruzi positive 
serum found not k> react with the tripeptidc 2/D/E was used in screening for the 
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TcLol.2 epitope. This serum was detected by the TcLol.2 epitope and also by the 
dipeptide mix (PEP-2/TcLo!.2 together with TcD/TcE) as expected. 

A tetrapeptide containing the four immunorcactive T cruzi epitopes 
PEP-2. TcD, TcE and TcLol.2 in a linear sequence, herein after referred to as 
5 2/Lo/2E/D (SEQ ID NO:63) was synthesized as described in Example 2 This 
tetrapeptide was coated at lOOng/well and its reactivity with T. cruzi positive and 
negative sera was assayed as described in Example 3. The reactivity of the tetrapeptide 
2/Lo/2E/D is shown in Figure 8, The one T. cruzi positive serum found not to react 
with the tripeptide 2/D/E was detected by the tetrapeptide as expected. 

10 The four tmmunoreactive T cruzi epitopes PEP-2, TcD. TcE and 

TcLoL2 may also be linked into one reagent by the use of a branched' peptide 
originating from a lysine core residue. Orthogonal protection of the lysine, for example 
employing 9-FluorenyImethoxycarbonyl (Fmoc) on the a-amino group and l-<4,4- 
Dimethyl-2,6-dioxocycloheX'l-ylidene)cthyl (Ddc) on the e-amino group, is used to 

15 permit selective deprotection of one amino group in the presence of the other, thereby 
allowing the synthesis of the first peptide chain from either the a- or €- group on the 
lysine. This first peptide chain is terminated with a protecting group that is not 
removed during the course of the synthesis of the second peptide chain. For example, a 
tert-Butoxy carbonyl (Boc) amino acid could be used with the Ddc and Fmoc 

20 combination. The remaining lysine amino protecting group is then removed before a 
second amino acid chain is synthesized from the second amino moiety. For example, f> 
Ode is removed with 20% hydrazine. Cleavage of the branched peptide from a solid 
support and removal of the N-a-Boc moiety is carried out using trifluoroacetic acid, 
following standard protocols. Using this approach two independent amino acid 

25 sequences can be built from a 'core' lysine residue, as shown below, thus allowing 
various combinations of TcD, TcE. PEP2, TcLol.2, and other epitopes to be coupled to 
the core residue. Purification of the resulting peptide is performed as described in 
Example 2. 

30 ^ 2/D/E 



Lysine 




TcLol.2 

35 
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Example 8 

Comparison of the Serological Reactivity of TcHi29 and TcE 

The antigen TcHi29 (SEQ ID NO:52) was shown to be a polymorph of 
5 the TcF repeat sequence. A TcHi29 peptide was synthesized that had the following 
sequence as compared to TcE. 

TcE KA AI AP AKA A A A PA K AATAPA (SEQ ID NO; 55) 

{0 TcHi29 KTAAPPAKTAAPPAKTAAPPA (SEQ II) NO: 64) 

Hgure 9 shows a comparison of the reactivity of these two related 
seqences with sera from T. cruzi positive patients as well as from other disease 
categories, as determined by liLISA using the procedure described above. The data 

15 indicate little or no cross reactivity with the other disease groups tested but the 
distribution of reactivity amongst the T. cruzi positive sera partially overlapped for the 
two peptides. Of the 53 consensus positive samples tested, TcE detected 31/53 and 
TcHi 29 36/53. Within this group Tel: and TcHi29 both detected 24 of the same sera. 
TcE detected 7 positive sera not detected by TcHi29, which in turn detected 12 positive 

20 sera missed by TcE. A dipeptide, TcD/TcHi29, was also synthesized and used in 
combination with the PEP-2/TcLol .2 dipeptide in ELISA {100ng/weil TcD/TcHi29, 
250ng/wc]| P£P-2/TcLo].2) and compared with the TcD/TcE plus PEP-2/TcI.ol.2 
dipeptide combination. As shown in Figure 10. the data indicates that the overall 
activity of the two mixes are similar for both the T. cruzi positive and negative 

25 populations studied and suggests that, in such peptide combinations, TcUi29 can be 
considered to be an alternative to TcE. 

From the foregoing, it will be appreciated that, although specific 
embodiments of the invention have been described herein for the purpose of 
30 illustration, various modifications may be made without deviating from the spirit and 
scope of the invention. 
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SEQUENCE LISTING 



(1) GFNFRAI INFORMATION: 

C) APPLICANT: Corlxa Corporation 

(ii) TITLE OF INVENTION: COMPOUNDS AND MFTH03S FOR TKF. DETECTION AND 
PRPi/rNTION OF T. CRUZ J INFECTION 

(iii) NUMBER OF SEQUENCES: 65 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : SEED and BERRY IIP 

(B) STREET: 6300 Columbia Center. 701 Fifth Avenue 

(C) CITY: Seat-Ze 

(ID) STATE: Washington 

(E) COUNTRY: USA 

(F) ZIP: 98104-7092 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible? 

(C) OPERATING SYSTl M: PC -DOS/MS -DOS 

(D) SOFTWARE : =>atentln Release #1.0. Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 14-N0V-1995 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: HaSci . David J. 

(B) REGISTRATION NUMBER: 31.392 

(C) REFERENCE/DOCKET NUMBER: 210121. 422PC 
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(ix) TFI FCOMMUNJCATION INFORMATION: 

(A) TELEPHONE: (206) 622-4900 

(B) TELEFAX : (206) 682 6031 



(2) INFORMATION TOR SbQ ID NO:l: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 518 base Dairs 

(B) T V PE: nucleic acid 

(C) S7RANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SLQ ID NO:l: 

CGGGAAAAGA AG6CTG1TAC GACGCACGAG CTTGGCTTTG AGGGCGAGGA CTG3GACTAC 60 

GTGCTGGAGC GGCGGCGCGC GGAGGTGAAG GACGTGCTGG CCGTCGAGAC GGCGCGGGCG ".20 

TTGGGACTCG AGCGTGAGGA CGTGCTGGAG GTGGAGGTCG ACGCAGTGCC TCGGAGCCTC ISO 

ATTGCGTTTG TCACGGTCCG TCATCCATCA CTGCTGAGCG ACCGCAGGTG GAAGAGACGC 240 

TGGCGCGCIG CGAGTACAGG AAATTGTGGG CGCTGTACGA GACGCGGCCA CTGGAGTCGT 300 

CAG:GCTGAT GAGGCGGm GAGGGCGACG ACTGGGACCT CGTGGTI'GAC AACAACCGCA 360 

GGAAGCTTGA GGACGCGTTC AGCAG3GAGA CGGCCGCGCA CTGGGCGTGT CGCCGAGGCA 420 

GGTT3TGCTT CTGGACTGCA GGGTTGGCAG CCTTCTCATG GTATTCAAGG TGCTTGGATG 480 

CGCCATGAGC GACGCAGAGA TCACGGAACG GACCGAGG -IB 

(2) INFORMATION FOR SEQ 10 NO: 2: 

(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 560 base pairs 
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(B) TYPF: nucle J c acid 

(C) S'IRANDEDNESS: single 
{!)} IOPOIOGY: l:nefi- 

(xi) SEQUENCE DESCRIPTION: ST.Q IR N0:2: 

CGGCGGTAGT CTGCGATGCT GTGGACCGAC GCAT7GAAAT ACACACCG'iC TTCGGCGTTC 60 

CTTTTTTTA TATG f I I 1 1 1 TTTATTGAGA AGATGTCTTG TTTGTTGTTG TTTTTTTTCA 120 

GTTTTTA'IGA I ACGAGCAGT TTGTCCGACT GCATTCATGC AGTGATTGGT AAT TCTTTCT 180 

ATTCTTTGGA ATTATGGCGA TATTATTCTT GTCTTTTAAA ATTCTTACAA CCAATTGTGC 240 

CTTAGA.GTTT CCTGCTTAGT TGCTATTAAC ACAC7GTTAG GAACGC6AM CCATGCAGA1 330 

CTTCGTGAAG ACACTGACGG 6CAAGACGAT CQCGCTCGAG GTGGAGTCCA GCGACACCAT 360 

TGAGAACGTG AAGGCGAAGA TCCAGGACAA GGAGGGTATC CGCCGGACCA GCAGCGCCTG 420 

ATCTTCGC1G GCAAGCAGCT GGAGGACGGC C6CACGCTCG CAGAC1ACAA CAICCAGAAG 480 

GAG1CCACGC TGCACCTTGT GCTGCGCCTG CGCGGCGGCA TGCAGATCTT CGTSAA6ACA f>40 

CTGACGGGTA AAGACGATCG 560 

(2) INFORMATION FOR SEQ ID N0:3: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDLUNLSS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
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CGGCTCCCTC CTCTGCTTCC TTCCTCGGAC GTGCCC3AAG GCATGGAGCT GCCTCCTCTG 60 

CTTCCTTCCT CGGACATACC CGAAGGCATG GAGCTGCCAC CTCTGCTTCC TTCCTCGGAC 120 

GTACCCQCGG GCATGGAGCT GACACCTCTG CTTCCTTCCt CGGACGTGCC CGAAGGCATG 280 

GAGCTGCCAC CTCTGCTTCC TTCCTCGGAC GTACCCGCGG GCATGGAGCT GCCACCTCTG 240 

STTCCTTCCT CGGACGTACC CGCGGGCA16 GAGCTGCCTC CTCTGCTTCC TTCCTCGGAC 300 

GTACCCGCGG RCATAGAGCT GCCACCTCTG ATTTCCTNCC TCGGACG'IAC CCNCAGGNAT 360 

GGAGATGNCT CCTCIGNTTC CTGCCTCGGA CGTNCCCNAA GGNA7AGAGN TGCNCCTCIG 4?0 

NTTCCTNCCT CGGAAG 43 6 
(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 373 base pairs 

(B) TYPE: nucteic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEOUENCt DESCRIPTION: SEQ ID NO:4: 

CCfCAGGGGC TCTTGGCGTT CCTTTTTTTC TTGTTGTTTT C^GTTTTTTT TTCTTTTGTT 60 

TTGGTTTGTC GTCTCTGTTT TTATGTGCGT 1GTTTTCGST TTrCTTTTT GTTCTTCCTG 120 

CCTG'CATGT GACTAGTTTT ATGTTTTCCA GGCCGACCGI CACTCAATTT TTTTATTTTT 180 

ATTTTTATT* ATTTATTTG.A CCCGCCTTTC 1CTG7AGTTT ACGAGAGTT AGATnTTAT 240 

TGATTGGTAG TTTAGGGCCA TCAGGCGGGA GGGGCGAGTC TGGCGGAAGA CAAAACAAAA 300 
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TACGATGGAC TCGACCA4CA GCATCGAGAA AICGCITCTG ATGGAGATGG AGC3GGAGGT 360 
IGAGAGGGCG AGG 373 
(2) INFORMATION FOR SEQ ID NO: 5: 

(O SFQUFNCF CHARACTERISTICS: 

(A) LENGTH: 560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) 1 OTOLOGY: linear 

(x ; ) SEQUENCE DESCRIPTION: SEO ID NO: 5: 

CAGAAAAAGA ACGTAGATTT CCAACCAAAA CAGCAAGAGC GGATCCAACA ACGACCAAAC 60 

AACTCATAT TCGAGCTCTC CAAAATATAT CGCTIGCCTT CGGGATTGAA CCCTCATCTA t?0 

CAGTAAAATA CGCCGAAAGC ACGCAAGAAG AAAATGGAAA ACGTTCACAA AG 1 GAGGCCC 180 

AGGAGCGTGC ACGGCGG&AG GCTGAGGAAC GAGCACGGCG AGAGGCTGAG GAACGAGCCC 240 

AACGAGAGGC TGAGGAACGA GCCCAACGAG AGGCGAGGA ACGAGCACGG CGGGAGGCTG 300 

AC.AAGCGTGC CCGGCGAGAG GCTAAGGAAC GAGCATGGCA AGAGGCCGAA GAACGAGCCC 360 

AACGAGAGGC IGAGGAGCGT GCCCGGCGAG AGGCTGAGGA GCG'GCCCGG CGAGAGGTTG 420 

AGGAGCGTGC CCGGCAAGAG GCTGAGGAAC TCGCACGGCA AGAGTCTGAG GAACGTGCAC 480 

GGCAAGAGGC CGAAGAACGA GCATGGCAAG AGGCTGAGGA GCGTGCCCAA CGAGAGGCTG 540 

AGGAGCGTGC TCAACGAGCG 560 
(2) INFORMATION FOR SEQ ID N0:6: 
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.(■») SEQUENCE CHARAC T ERISTICS: 

(A) LtNGlH: 440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

GCCTCCTCCA ACTCGAGCTG GCAGCGTSGA GGTGCNGCAG GAAC7CTCAA NAGAHGACQG 60 

CTCTCCCTCG ATANCNTTCG 6AGTGACTTK GACTGTTGCG CCNTTTCCGT NTCACTATTT 12C 

CTATTGCTTT TAATTTGCTG GAGAGGCGCG TGTAG3AGGS AAAGAGIAGT AACArGGCAG 180 

AAT CATC AAA AACCATGTTG CGI IAGTAGA GAGGAGGGAA ACATCGAGAC GTTGAGGGTT 240 

GCGACGGNCA AAATTATGTA CATTTACCTG AATTAGGATA. AGACTTCATA TG3CATAAAC 300 

TCGTGGCG 1 1 GTTGGTGGTT ATAACAA.GCA ACGGTGACGA TGTCTTAGGC TACAC'F GC TG 360 

CAC TCAAAGA GTTT TACAGG TACTTGCGGG ATATTTGTTC CTGTGAGTTT GT1 FTCTATT £20 

GTAATTTATT NNGTCTCAAT 440 
(2) INFORMATION FOR SFO ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1915 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDMSS : single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 7: 
CGATGCGTCT GTCGTAGACC TGGGAGGCGA GGCCCATGGG ACACACTATG CCT1TTTGCC 60 
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CGATGTGATC AA3GGGATTG CGCAGGAAGA GCTGTACCTG GAAGACGATG CGTACFTCA 120 

GGAG TIGCTT GCGAGGTATA AAGAACTTGT CCCTGTGGGT GCCGAGCCAA CCGAGCCACG 18:: 

CGCAAAGCAG TTGCGCGAGC AAATGCGGAT ACGGGCTGGG CAGCTTGCTG TTGACACCCG 240 

AAAGCTTCAT GCGGCCGAAG AGCGGGCTGC ATCGCGGATG GCGACACTTT ACCCGTTTGT 300 

GGGCTCGGCG CCGCTGGGAG TTGCTCTG'G GAAlAiCCCC GTGGAGGCGG ACGAAGAGTT 360 

CTGTGCACTT CTGCTGAAGC GCGAAGAAGC GCTGGCGGGG AAGTCAGGGT CCGTCCACGA 420 

AGTGGAATCT GCGCTGAGCG CGCGTGCGGA AGCGATGGCG AAGGCGGTGC TGGA3GAGGA 480 

GGAGGCGC 1 1 GCGGCGGCAT TTCCATTTCT GGGGCGGAGT GTTAAGGGAG CCCCTCTGCG 540 

TGAGTTGGU C I LATGTCTG ATCCCAAI 1 I 1GCGGAGCIG GCGACACGGC ACGCGCAGGA 600 

GGCGAC(TCG GGCGATGCGG CGGGTATTTT GCGCCTTGAG CA6GAGCTGC GTGACCAGGC 660 

ATGTCGCATA GCACGTGAGG TGCGAGTGGC TCGGCGGCTT GACGCCG1CG CAA I GAGGAC 720 

CTGCACGAGC GGTACCCGTT ICTICCCGAG GAGCCG3TGC GCGGCATTCT TCTTGGTGCT 7SZ 

GTGCGTCCGG TGCAGCAACC GGCGTTCCGC GAGCTTTCAA. ACAA.GTTGGA TGAGCAGCGC B4" 

CGGGACCCGA CACGCAACGC AGGCGCGATC CGCACGACGG AGGAGCAGAT GAC1GCGTTG 900 

GTGGTGCGAC TGGCTGAGGA GCGCGC6GAG GCGAUGGAGA GGGCGCATGA GCAGTACGCS 960 

TTfCTCCCAC GACGTGTGCT GGGCGTGCGC CTTG3TGACA TCTCGC FGCA GGAGGATGA1 1Q2Q 

GTGTTGTCAC AGCTGGCGCG GCGTCCTGTG CGGCAGCTAA GAAACTCCAA GACG6CGAT" 1080 

GACGCACACG CAACTGAAGA AGAGATGA1 A AGGCGCGCAG AGGAGCTGGC TCGCAACGTG 11.10 
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AAGCTTGTCG ACGCATACCG TGG6AATGGC AACGAGTACG TGCGTGCCTG CAACCCGTT'i 1200 

CTCGTGTACG AGGACCGCAA GTGCGTCCTC CTGAGTGAGC TGCCGCTTGC CGGTGGCGAC 1260 

GTGIACCAGG GC11G1 ICCG GGA7TATCTG ACTGCGCTGG AGGACGCCGA GGCAMTGCA 1320 

CCGCGGATCG CGGAGCTGGA GAATGCGCTT CGGTCCCGTG CGGATGAG~T GGCGCTGGAG 1380 

GTTTGCGAGA GGGACGCGCG GTTGT"GCAT TACTCATTCC TCTCGGCCCA GGATGTTCCT 1440 

GGTTGGTCTG AAGCACTGCT GCATGACGCG GAGTTTCAGC AGCTACGTGA GCGTTACGAG 1500 

GAACTGAGCA AGGATCCACA GGGGAACGCC GAGGCATTGC GTGAGCTTGA GGATGCAAIG 1560 

GAGGCTCGGA GCAGAGCCAT TGCGGAAGCG TTGCGGAC7G CAGAGCGACT AATCCACTGA 1620 

GCAGGCGAGG CTGAAGACGC CGTCACAGGC GGGGTCTGGC GTGTCCGCGG G mATCGAAT 1680 

GCATuGCAGC GAGCAIGCGG ATCTC3CGCA TGAAGGGGGA AGCACGGCTG GCGGCACCAT 1/40 

GAGGGGGGCA GAGTCTGTCT CCAAGAGCAG TGGGAAACAC TCTCAAGGTC GGTCTCGCA I ]800 

GCGTCTGTCG TAGACC7GGG AGGCGAGGCC CATGGGACAC ACTA^GCCTT TTTGCCCGAT 1860 

6TGATCAAGG GGATTGCGCA GGAAGAGCTG rACCTGGAAG ACGATGCGTA CTTCG 1S15 
(2) INFORMATION FOR SEQ ID NO:8: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SFQ ID N0:8: 
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T TACCAAGCT GAGATAGA7A AAAGGCTGCA GGAGCAGCTT GCCCCTGAGA GGATGAGGGC 60 

TCTTTCCGCA TTTCTTTCGG AGTGACTTTG ACTGTTGCGC CGTTTCCGTG TCAC'I A 1 1 I C 1?0 

TATrGCTTTT AATTTGCTGG AGA3GCGCGT GTAGGAGGGA AAGAGTAGTA ACATGGCAGA 180 

A1CATCAAAA ACGATGTTGC GT7AGTAGAG AGGAGGGAAA CATCGAGACG TTGAG3GTTG 240 

CGACGGNCAA AATTATGTAC ATTTACCTGA ATTAGGATM GACTTCATAT 9GCAIAAACT 300 

CGTGGCGTTG TTGGTGGTTA TAACAAGC.AA CGGTGACGAT GTCTTAGGC1 ACACTGCTGC 360 

ACTCAAAGAG Til IACAGGI ACTTGCGGAT ATTTGTTCC1 .100 

(2) INFORMATION FOR SFQ ID NO: 9: 

(i) StQJENCt CHARACTERISTICS: 
(A) LENGTH: 936 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEONESS : single 

(D) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

GCCTCCTGCA ACTCGTGCTG GCAGCGTTGA AGTTCGGCAG AAATCTCAAC AAACGCCTTC 60 

TGTCCCTCGG AAACCTTCCC GTTAAGAGAC ACAAGCAGTT CAATGAGCGA CATGGTCGCT 120 

TCGGACACGT CCAATGCTTr CATGGTTTGT TCCAGCCGCC GCTGAAAG'T ATCCACACAT 180 

GAGAACAACA AAGACAAATC TAAATCGGCG ICGCCGTGCT CATACACATC AAACGCCACC 240 

CTCTCGCCCA AAACAT r CAA AAAGTTCACC AAAAACTTTA CAAGCTTACT CAAATT3TCA 3U0 

CGAAGTGAGC TAACGGTAAT TTCTAAACT I CCATTTCTTG CGTCATCCCT AGCCTTCGCC 360 
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GCGACTACCT TCTCCTTCCA TAGCACTAGC TTCTCCTCCA CCAAACGAAT ACCGCTCTCC 420 

TTTTCTTTCA CAGCAACCTC ACATTCCCTT TCAATTTCAT TCAACCTA4T TGGA'l 1ATTT CQQ 

TCTTAAACGA CTTGCCGTGC CC1CCTCGGG CTGATGAAAG GCCTCGCCCA GCTGCGCACG 540 

CAGATrCACG GTGTCCGCCC CGTTCTGCTC CCGGAGAGCG GCCAGTTCCT CGGTGG'I TCG 600 

CTTCAGCTCG CGATGCACCT CCTCGCGCTG CTGCAAGGCC TCGTCCAGC T GCOCACGCAG 662 

ATTCACGGTG TCCGCCCCGC TCTGCTCCCC GAGAGCGGGC AGTTCCT"CGG I GGTTCGCT1 720 

CAGCTCGCGA TGCACCTCCT CGCGCTGCTG CAAGGCC T CG TCCAGCTGCG CACGCAGATT 780 

CACGGTGTCC GCCCCGCTCI GCTCCCGGAG AGCGGGCAGT TCCTCGGTGG TTCGCTTCAG 8*0 

CTCGCGACGC ACCTCCTCGC GCTGCTGGAA GGCCTCCCCC AGCTGCGCAC GCAGATTCAC 900 

GGTGICCGCC CCTCTCTGCT CCCGGAGGGC GGGCAG 936 
(2) INFORMATION FOR SEQ 10 NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SFG ID NO: 10: 

ACTTGAAAGA NTGACCCAAT AATNGGGTTC CTTATTGTGC CACCCCAAAT AAACCCGTAA GO 

CCAATTTGTG GCTGGGATGG ATCCCCCCAC NCTCTTTGAC NCATGTCAAG AGTANATGGG 120 

ACGTCAAAGI CACTTAGAGA GGGATTCATG GGTNCCATTG ATCACAAGAG CCTNCTGGAA 180 
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GACCCCCCTG AAGATAAOX AATGAGATT ATCCTCTGCA TA4GATCACA CGAGGCGG":A 

TTAGCAATTA TCTTCACAGA TrCTTTTTCT TGTGATGGTG GCTTGCGGTA GTTTGTCATC 300 

ATTGTTTTCT GAATGCAATG AAGCACACGA CTTGTAATAC GTTCTCCATG TCTTTCAATC 360 

GTTTCCAACG CCTCCAC.AAT GTC7GCAGGA TCCCCAGGAA GGTCAGCAGT CATCAGAAGC 4?0 

TCTTCACATG AACGCCGTAA ACTAGGATCA CGCTCAACAA GGC'I AGCAA T CGCATTTGCC ASO 

A11CICGGAI [GCACTTGCA AAACCACTCC GGAAGlMAt I ICCACGACT GACCTCTGTC 540 

ATAATGTTGA ACCTCTCCCT AAAGCCTTTA CCCGCCACGG CAAGCCACAI CTCAAGAGCT 600 

ATCATACCCA GGCTGTATTC ATCCACTTTA AAG1CG7AG T CTTCCCCTCG CTCTTGCTC* 660 

GGGGCACAGT ACAACACAGA ACCCAAGTTT CCTGTAGGAC CG 70? 
(2) INFORMATION FOR SFQ ID M0:li: 

(1) SEQUENCF CHARACTER I ST ICS: 

(A) LENGTH: 510 base pairs 

(B) TYPE: nucle-c acrd 

(C) STRANOEDNESS : single 

(D) IOPOLOGY: linear 

Ui) SEQUENCE DESCRIPTION: SE0 ID NO: 11: 

CGAGTATTCC TGTGGAAATT GATATTAGAA ACCAGGACTT TTCTTTTCTT GACCCG3CAC 60 

CGGAGGGCAT TCCTATTCAG GACATACATC TTATGGGAGA TTCTGCATTT GCCGCATCTG 120 

CGCGTGAGCG CATGAAACTG AAAAGAAATC CTGTTGCGAA TGCGAGCAAG ATCAGTGCCC 180 

TTGAGGAGGA GATGGATCAA CGTGCTCATG TATTGGCTAA GCAGGIGCGT GACAAAGAGC 240 
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GCACTTTCCT TGATCCAGAG CCTGAGGGTG TTCCACTTGA GTTGCTTi'CA TTAAATGAAA 300 

ATGAGGCCTC ACAGGAATTG GAGCGAGAGC T7CG1GCCCT AAATCGCAAA CCCCGGAAGG .360 

ATGCCAAAGC .AATAGTTGC! CTTGAAGA I G ATGTGCGTGA CGAACACACG TGCTTGCCAA 123 

GGAGCTAAAG GAAAATGAGC GGAACATCTT TGTTGGCTCC ACAGCCTGAG GGTG7GCCSG 480 

'1GTCTGAGCI GTCGTTGGAT TTAGACGAGC 510 
(?) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 320 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linea- 

(xi) SFQUFNCE DESCRIPTION: SEO ID N0:12: 

CGGTCGTGGC AGAGCCAAAG CCACCAACAG CAGGTGCCGA CGTGTGCGCG GCAGAGCCGA 60 

AGCCACCAGC AGCAGGCGCC GAAGTGGICG TGGCAGAGCC AAAGCCACCA GCAGCAGGTG 120 

CCGACGTGTG CGCGGCAGAG TCGAAGCCAC CAACAGCAGG TGCCGACG1G GTCGTGGCAG 180 

AGCC.AAAG I C ACCAGTAGTA GGNGCCGACG TGTGNGTGGC AGAGNCANAG NCACCAGTAG ?4C 

NAGGTGNCGA CGTNG1CGTC GNAGASNCGA NGTCACCAGC AGGAGGTGNC GACGTNTGNG 300 

NGGNAGAGGC GATGTCACCA 320 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 302 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : "linear 

(xi) SEQUENCE UtSCRlPIION: 5EQ ID N0:13: 

ATGCATCTCC CCCGTACATT ATTTTGCGGA AAATTGGATT TTTACGGGGA GGTGGGGTTC 60 

GATTGGGGTT GGTGTAATAT AGGTGGAGAT GGAGTGCAG I GGGAIAGGAT TAGAATGTAG \?0 

TTGGTGTAGT ACAGAGTTTA TATAGTATAG TGTTGA1G1 I A I IAIACAAT GAGGTAAGAG 180 

AATGGAGTGA GAAAGAGTAT GTTTGTTAG T TTGGTTGTTA ATGTTATGTA T'CATGTTAT ?4() 

CAGTATATGT TG'I A'IGTGIA TGGIGA1AGC GGTG3GTGTA GCTGTATGTG GTAGGTTAGA 300 

GT 301' 
(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 298 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 

CAGTTTCAAT TTCCTCTCCA CCTGATCCCG CTGTTGCAAA AGCG I CCTTG ATGTATCCTG 60 

CTCCTTTGCC GCTAGCGCCT CCOTGC1AA GCGCAG1ICC TCTfGCAGCC TCCCCTGCAC 120 

CCGTTCCGCC TCCATTAATC TCTTCTCCCC GATTGCTTCT TTGGCGCGTA AATCCTCCAG 180 

TTCCTTCTCT ATCAAAGTGT GCCTCCCATT CCTGATCCGC GACTCTTCAC AGGCTTCTTG 240 
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CTCCGCGTCA CGGAGACGCC TCTTGAGAGC CTCGT TCT1C TCTTCCAGGT CTTCTGGG 298 
(2) INFORMATION FOR SEO ID NO: 15: 

C ) SEUUrNCC CHARACTERISTICS: 

(A) I ENGTH: 2144 base pairs 

(B) TYPE: nucleic acid 

(C) 5TRANDEDNES5: single 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:I5: 

CGCGGAATTC TTACCAAGCT GAGATAGATA AAAGGCTGCA GGAGCAGCTT GCCCCTGAGA 60 

GGATGAAGGC TCTTTCCACA TTTCTTTGGG AGTGACCTTG ACTGTTGCGC CGTTTCCGTG 120 

TCACTATTTC TATTGCTTTI AAATTGCTGG AGAGGCGCGT GTAGGAGAGA. AAGAGTAGTA 180 

ACATGGCGGA ATCATCAAAA ACGATGTGC GTAAGTAGAG AGGAGGGAAA CATCGAGAC3 24:) 

TTGAGGGTTG CGACGGCCAA GATTATGTAC ATTTACCTGA ATTAGGAIAA GACTTCA1 AT 300 

GGTATAAAGT CGTGGCGTTG TTGGTGGTTA TAACAAGCAA CGGTGACGAT GTCTCAGTCJ 360 

ACACTGCTAC AATCAAAGAG TTTTACAGGT ACTTGTGGA7 ATTTGTTCCT GTGAGTTTG1 420 

TTTCTATTA- AATTTATTTT GTCTCAATTT TTTGTTTCCC C3CTTCCTAC GGTCTCTTTT 480 

TT7CTTCGT' CTTGAAATTT CAATTATTGC TTAACCACAA GCATCCAG1A CTTCAACCTC 540 

CCCA1CAAAT GGTGTCGCTG AA.GCTGC.AfIG CTCGTTTGGC GGCGGACATT CICCGCTGCG 600 

GI'CGCCACCG TGTGTGGCTG GACCCTAATG AGGCCTCTGA GATTTCCAAT GCAAACTCGC 660 

GCAAGAGCGT GCGCAAGTTG ATCAAGGATG G r CTGATTAT TCGCAAGCCI GTCAAGGTGC <2C 
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ACTCGCGCTC CCGC1GGC3C CACATGAAGG AGGCGAAGAC CAT66GCCGC CACGAGGGCG 780 

CTGGGCGCCG CGAGGGTACC CGCGAAGCCC GCAIGCCGAG CAAGGAGCTG TGGA1GCGCC 840 

GTCTGCGCAT TCTCCGCCGC CTGCTGCGCA AGTACCGCGA GGAGAAGAAG AT1GACCGCC 90G 

ACATTTACCG CGAGCTGTAC GTGAAGGCGA AGGGGAACGT GTTTCGCAAC AAGCG1 AACC 960 

TCATGGAGCA CATCCACAAG GTGAAGAACG AGAAGAAGAA GGAAAGGCAG CTGCCTGASC .1020 

AGCTCGCGGC GAAGCGCCTG AAGGATGA.GC AGCACCGTCA CAAGGCCCGC AAGCAGGAGC 1080 

TGCGTAAGCG CGAGAAGGAC CGCGAGCG"*3 CGCGTCGCGA AGATGCTGCC GCTGCCSCCG 1140 

CCGCGAAGCA GAAAGCTGCT GCGAAGAAGG CCGCTGCTCC CTCTGGCAAG AAGTCC3CGA 12C0 

AGGCTGCTGC ACCCGCGAAG GCTGCTGCT3 CACCCGCGAA GGCCGCTGCT CCACCCGCGA 1260 

AGACCGCIGC TGCACCCGCG AAGGCTGCTG CACCTGCCAA GGCTGCTGCT CCACCCGCGA 1320 

AGGCtGClGC TCCACCCGCG MGACCGCTG CTCCACCCGC GAAGACCGCT GCTCCACCCG 1380 

CGAAGGC1GC TGCTCCACCC GCGAAGGCCG CTGCTCCACC CGCGAAGGCC GCTGC T CCAC .440 

CCGCGAAGGC CGCTGCTGCA CCCGCGAAGG CCGCTGCTCC ACCCGCGAAG GGGCTGCTC 1500 

CACCCGCGAA GGCCGCTGCT CCACCCGCGA AGGCTGCTGC TCCACCCGCG AAGGCTGCTG ibbO 

CTCCACCCGC GAAGGCTGCT GCTGCTCCCG TTGGAAAGAA GGCTGGTGGC AAGAAGIGAA 1620 

GCGCGCACTA GTACGACCAA Cl IGTTTTTT TTTTTGGTAT TTAATATTTT CIGAGGAAGA 1680 

AGTGGGTATT GAGGGTCTTT CTTTCCGCGT TTGTGTTGCT TTGTGGTGTT CGTGACATTA 1740 

TAGTAGATCC AAAGTATCT TCAGTGTCCC TTTCCTTTT CTCCATCCTT TTTCC'I ATTT 1800 



Printed from Mimosa 01/07/09 1 1 :43:50 Page: 44 



WO 97/11 8475 »CT/US96/18624 

43 

T1TGTTTGTC "TCTCTACGA TCTTTGTTGT CGTGIGACCT CCGCTGTATG GAACTGACGG I860 

CCGGCGIIGT GAGAGACGAT GTCGCACGIC ACGGCGGACC TGGAGTATTT TAAATGTGAC 19?0 

ATGTGCGGGG TGTAI'CTGCA CAAAGACATC "TTTCCGACC ATCGACG1GA GTGTAAAGGC 1980 

CTTGATTCGA AAGAGCTGAA GAAGAGCCAG TGTCGTCAGA TCGGGATGGC ATTAGACAAG ?040 

GAGGCACGGC ACCGAATTGC G1CACGAATG GC T GATGGAG CAACTCTCGT GCCTGTCGAG 2100 

CTTGCAGAAC GACA^CAACA GGCGCGTGTG CGGCGTAATG TGGC 2144 

(2) INFORMATION FOR SrQ 10 NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 456 base pairs 
(R) TYP:: nucleic acid 

(C) STRANDEDNESS- single 

(D) TOPOLOGY: linear 

(xl) SCQUCNCE DESCRIPTION: SFQ ID NO: 15: 

TGTGCT3CAG AAGGAGAGGG ATGAACCCGT GGCGGAGAAT GCCCAGCTGC AGAAGGAGAG 60 

GGA1GACGCC GTG3CGGAGA AT3CCCAGCT GCAGAAGGAG AG3GAT6ACG CCGTGGCGGA 170 

GAATGCCCAG CTGCAGAAGG AGAGGGATGA CGCCGTGGCG GAGMTGCCC AGCTGCAGAA 180 

GGAGAGGGAT GACGCCGTGG CGGAGAATGC CCAGCTGCAG AAGGAGAGGG ACGAAGCCGT 240 

GGCGGAGAAT GCCCAGCTGC AGAGGGAGAG 3GATGACGCC GTGGCGGAGG ATGCCCAGCT 300 

GCAGAAGGAG AGGGATGAAG CCGTGGCGGA GAATGCCCAG CTGCAGAGGG AGAGGGATGA 360 

AGCCGTGGCG GAGAATGCCC AGCTGCAGAA GGAGAGGGAT GACGTCGTGG CGGAG.AATGC 420 
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CCAGCTGCAG AAGGAGAGGG A1GACGCC65 GGCGGA /]56 
(2) INFORMATION TOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2446 base pairs 

(B) TYPE: nucleic acid 

(C) S fRANUEDNESS : single 

(D) TOPOLOGY ■ linear 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 17: 
TGAAGGCCGT TGATCCTTTT CAGGGAACGA CACCGCCGCC CIATAAATGG CAAGAAATGA 60 

CTGGATCTGA GGCGGCAGCC GGCTCGCTTT GTGTACCCA3 CCTTGCTGAG GTGGCCGGCG 120 

GTGTGTTTGC CGTTGCTGAA. GC1CAGCGCA GTGAAAGGGA CGAAGCCTGC GGCCATGCTG 180 

CGATTGCAAC AACGCACATT GAGACGGGCG G'GGTGGCIC AAAGGCGATC TCGGCGATGG 240 

ATGCAGGCGT TTT1 C1CGTA GAACTTGTGG ATGCCGCCAG TGGTACGATC AGGACACGAG 300 

AAAAGATGCA GCCAACGACA ATTGTGAGCG GCGACACTA* CTACATGGCC CTTGGGGACT 360 

ACGAGAAGAA GACGTCTGGG GGTCGGGCTG CCGATGCAGA TGGC I GGAGG CTTTTACTGA 420 

TGAGGGG.AAC TCTCACTGAG GATGGTGGGC AGAAGAAAAT CATG7GGGGT SAIATCCGTG 4B0 

CAGTGGACCC TGTGGCCATC GGGCTTACTC AATTCCTGAA GAGGGTGATC GGTGGCGGAG 540 

GATCGGGTGT TGTGACGAAG AACGG I IACC 1 1GTGCTTCC CATGCAGGCA GTAGAAAAGG 600 

ATGGAAGGAG TGTTGTACTG TCCATGCGII ICAACATGCG TATACAAGCA TGCGAGCTCT 660 

CGTCCGGTAC GACAGGTAGT AACTGCAAGG AACCATCCAT CGCGAATTTG GAAGGAAATC 720 

TAATTTTAAT TACTTCTTGC GCTGCCGGCT ACIACGAAG1 A'TCAGGTCC CTTGACTCTG 78fi 



Printed from Mimosa 01/07/09 11:43:51 Page: 46 



WO 97/18475 



PCTYUS96/18624 



45 

GGACAAGTTG GGAAATGAG" GGTAGGCCAA TTAGTCGCGT GTGGGGCAAf. TCGTA 1 GSTC 840 

GAAAAGGG I A TGGCGTTCGC TGTGGCCTCA CCACCGTAAC CATTGAGGGA AGGGAAGTGC 900 

TGCTTGTTAC CACGCCAGTG TATTTGGAGG AGAAAAAIGG TAGGGGTCGG CTTCATCTTT 960 

GGGT6ACGGA CGGTGCACGT GTGCATGATG CTGGGCCGAT ATCCGATGCA GCTGATGACG 1020 

GTGCTGCCAG TTCCCTGTTG TATAGCAGTG GGGGCAATCT GAHTCGCTG TACGAGAATA 1080 

AGAGTGAGGG GTCATACGGT CTTGTTGCTG TGCACGTCAC TAC6CAGCTG GASCGGATAA 1140 

AGACTGTGT7 GAAGAGG'IGG CAGGAGTTGG ATGAAGCCCT AAGAACGTGC AGATCCACTG 1200 

CCACTATCGA CCCGGTGAGA AGGGGCATGT GTATTCGTCC CATTCTIACT GACGGGCTTG 1260 

TTGGCTATTT GTCTGGTC'IG TCGACTGGGA GTGAGTGGAT GGACGAGTAC CTCTGCGTGA 1320 

ACGCAACTG7 TCATGGGACG GTGAGAGGGT TCTCCAATGG AGTGACGTTI GAAGGACCCG 1380 

GAGCAGGGGC GGGGTGGCCT GTTGCCCGAA GTGGACAGAA TCAACCGTAC CATTTCI I AC 1440 

ACAAAACGTT CACTCTAGTG GTGATGGCGG ICATCCACGA TAGGCCGAAG AAACGCACCC 1500 

CCATTCC TTT GATTCGTGTG G TGATGCATG ACAATGACAA GAC1GIGCTA TTT3GTGT3T 1550 

TTTACACCCA TGATGGGAGG TGGATGACTG TAATTCATAG TGGCGGTAGA CAAATACTTT 16?0 

CAACAGGGTG GGACCC.AGAA AAACCGTGTC AGGTAGTGCT GOJACACGAC ACGGGCCATT 1680 

GGGATTTCTA CGTTAACGCG AGG.AAGGCTT ACTTTGGCAC CTACAAGGGT CTCTTCTCCA 2740 

AACAAACAGT ATTTCACACA "JCCAATTCCA CGGGGAGAGT GGGGAAGTTG CAGAGTCCAG 1800 

CCATTTGTCA CTCTTCAACG CCCGTTTGTA TAACCGAAGA CTCAATTCCA AGCATfJAAG 1860 



Printed from Mimosa 01/07/09 1 1 :43:51 Page: 47 



WO 97/18475 



PCTYIIS96/1R624 



46 

ATGGCTCATG GTCGGCGAGA CAGGCCCAAA ATACGATGAT GGCAGCTCTT ATTCTGCGAG 1920 

TGCGTCCGAG GAAGGAAGCA GAGGTGGCAG CTCCATGCCC GCGGGTACGT CCGAGGAAGG 1980 

AAGCAGAGGT GGCAGCTCCA TGCCTGCGGG TACGTCCGAG GAAGGAAGCA GAGGAGGCAG 2040 

CTCCATGCCT GCGGGTACGT CCGAGGAAGG AAGCAGAGGA GGCAGCTCCA TGCCTGCGGG 2100 

TACGTCCGAG GAAGGAA.GCA GAGGTGGCAG CTCCATGCC1 GCGGGCACTT CCGAAGAAGG 2160 

AA.GCAGAAGT GGCANCTCCA TGCCTTCGGG CTCTTCCGAA. GAAGGAAGCA GAAGAGGCCG 2220 

CTCCCTGCCT TCGGGTTCTT CCGAAGGAAG GAAGCAGAGG AGGCCCICCC TGCC1GCGGG 2280 

TTCTTCCGAA GAAGGAAACA GAAGTGGCNC TCCATGCCCG CGGGTTCTTC CGAGGAAGGA 234C 

ACCAGAAGAA GCNCTCCCTG CCCGCNGGTT CNTCCNAAGA AAGAAACAN'A AGTTGGCCNC 2400 

TCCCNGCCCC NNGTTTCTTC CNAANGAAAG AAAC.AAAAGT GGCCCC 2446 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SFQUENCF CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNLSS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GGGTACGTCC GAGGAAGGAA GCA6AGGT5G CAGCTCCA'IG CCTGCGGGTA CGTCCGAGGA 60 

AGGAAGCAGA GGTGTCAGCT CCATGCCTGC GGGTACGTCC GAGGAAGGAA ACAGAGGAGG 120 

CAACTCCATG CCTGCGGGTA CGTCGAGGA AGGAAGCAGA GGTGGCAGCT CCATGCCTTC 180 
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GGGCACCTCC GAGGAAGGAA GCAGAGGTGG CAGCTCCATG CCTTCGGGTA CGTCCGAGGA 240 

AGGAAGCAGA GGAGGCAGCT CCATGCCTGC GGGTACGTCC GAGGAAGGAA GCAGAGGTGG 300 

CAGCTCCAfG CCCGCGGGTA CGTCCGAGGA AGGAAGCAGA GGCCG 34S 

(?) INFORMATION TOR SEQ ID NO: 19: 

(i) SFQUENCE CHARACTERISTICS: 
(A) LENGTH. 835 base pairs 
CR) TYPE: rucitfic acid 
CO STRANDEONl SS : single 
(D) TOPOLOGY: linear 

(xi) StOUENCE DESCRIPTION: SEQ ID NO: 19: 

GGCACGAGCT GTACTATATT GTAGGAGAGC AGCCA73GGT ATCGTTCGCA GCCGCCTGCA 60 

TAAACGCAAG ATCACCGGTG GAAAGACGAA GATCCACCGG AAGCGCATGA AGGCCGAACT 1?0 

CGGCCGTCTT CCCGCGCACA CGAAGCTTGG CGCCCGCCGC GTGAGTCCCG TCCGCGCCCG 180 

CGGTGGGAAC TTCAAGCTCC GCGGTCTTCG CCTGGACACC GGCAATTTTG CGTGGAGCAC 240 

AGAAGCCATT GCTCAGCGGG CCCGTA1CCT CGACGTTGTG TACAACGCCA C1TCTAACGA 300 

GCTGGTGCGC ACGAAGACGC TGTGAAGAA CTGCATTGTT GTGGTGGACG C.CGCGCCCTT 360 

CA4GTTATGG TACGCGAAGC AC TACGGTAT CGACCTTGAG CCGCGAAGAG CAAGAAGACG 420 

CTGCAGAGCA CGACGGAGAA GAAGAAGTCG AAGAAGACCT CACACGCCAT GACIGAGAAG 480 

IACGACGTCA AGAAGGCCTC CGACGAGCTG AAGCGCAAGT GGATGCTCCG CCGCGAGAAC 540 

CACAAGATTG AGAAGGCAGT TGCTGATCAG CTCAAGGAG3 GCCGTCTGCI CGGCCGCATC 600 
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ACGAGCCGCC C~GGCCAGAC AGCCOXfiCC GATGGTGCAC 7GOGGAGGG CGCCGAACTG GGG 

CAGTTCTATC TGAAGAAGCT CGAGAAGAAG AAGCGGTAGA GAAGGATG'f I CGGGAGACGG 720 

GAGGAGGCGC CACCACCACC ACTCATGGTG ATGCACCCAC TACCTACTTT GTTTTCATTT 780 

TTTGTTTTAC CTCTAATTTT TTAGGCCAGA GGGGGGGAAA AAAAAAAAAA AAAAA 835 
(2) INF0RMAI10N FOR SfcQ ID NO:20: 

(i) SFQUENCE CHARACTERISTICS: 

(A) LENGTH: 555 base oairs 

(B) TYPE: nucleic acic 

(C) STRANDEDNESS : single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SCO ID NO: 20: 

GGCACGAGAA AAAAGAAAAC AAACAAATAA AATCAAAAAC AGTAAATCCA TCAC f'TCAAC GO 

AATGAGCATT GAGAGCGCCT TTTACGCCTT TGCCTCCTTT GGTGGTGCGC CCACGAAAGA i?0 

GATGGACAAT GCTCACTTCT CCAAGATGC1 GAAGGAGACG AAGGTCAT1G GAAAGCAATT 180 

CACCACCACC GACCCCGATC TTCTCTTCAA CAAAGTGAAG GCAAAGGGAfi CCCGCAAAAT 240 

TACATTGTCG GATTTTGTTG ACAAGGCTGT TCCTGAGATT GCATCAAAGT TAAAGAAGTC 300 

CGCGGAGGAA TTGATCGCAG ATATTTCAAG TTGCTCTCCC GAGGCACGCG CAACCAAGGC 360 

CGATGCAGT1 AAGTTCCACG ACGATAAGAA CATGTACACT GGTGTCTACA AGGCCGGCGG 420 

GCCAACAAAC GTGGATCGCA ACTCCGGCTC CCTTTCAGGI GTCGTGGATC GCCG TGTGGC 480 

GCAGACTGAC GTTCGTGGCA CGAC'IGCI 1C CCAGAAGTAA AGAGGGAAAC GAAATGGAAA 54 2 
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AAAAAAAAAA AAAAA 555 
(2) INFORMATION FOR SEQ ID N0:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 936 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDLDNESS: single 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GGCACGAGA6 CTCTCTTCGT CAGTCATGAC 6CTCQGGAA6 AACAAGCGCA Tf^AGCAAGGG 60 

CQGCAAGCGC GGCAAGAAGA AGACCCAGGA GACGATGAGC CGCAAGGAGT GGTACGATGT 120 

GGTTGCCCCC AAGAAC 1 1 !G AGGTGCGCCA 3TTTGGCAAG ACCATCTGCA ACAAGACCCA 180 

GGGCACAAAG ATCGCGGCGG ACIACCTGCG f.GGGCGCGTG TACGAAAGCA ACCT T GCGGA ?40 

TCTGAACAA.G ACGC.AAGGCG ACGACGACGC CTACCGCAAG GTGAAG" r TTG TTGTGCAGGA 300 

GGTGCAGGGC CGCAACCTGC TTACGCAGT1 CCACAGCATG GAAATGACAT CT3ACCGCGT 360 

GTACrTTHG CTGCGCAAGT GGTGCACGAC GATCGAGGCG GCAGTGGAGA CGAAGACTGC A20 

GGAC3GCTAC ACCCTGCGCC TCTTCSTGAT TGCCTTCACG AAGAAGCAGA GCAACCAGCT 480 

GTCG.AAGAAC TGC TATGCCA .AGACGCGCCT GGTGAAGTGG GTGCGCCATC GCATCACGAA 5-0 

CCTCATCCGC CAGCGCCTGT CGAAGGTGAA CATCAACGAG GCGGTGACGC TGCTGACACG 603 

CAACATCCTG CGCGATCGTC TGGCAAAGCG CTGCAACCCC ATCGIGCCGC TGCGCGATCT 660 

CCGCATCCGC AAGGTGAAGG TGGTCCGCAC CCCCCGGTTT TGACGCCCAG GCGCTTCTGA 7?n 
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ATGCACACGG CGAGATCCCC GCCTCGGCT6 AGGG1 GAGGC ACGCGTCGTC GAGGAAGCCC 7R0 

MGAGGCTCC CGCCGCTGAA GCCACAGCC1 AAGCCTTCCA TGTGGAGGAA GGATGTG7GA 810 

TGTGAAAGCT CTTTGTTCTT TTTCTTTCT ATTTTGAAAC GGTGATTCCG CA1ATATATA 930 

TTAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 936 

(2) INFORMATION FOR SEQ II) NO: 22: 

(i) SEQUCNCE CHARACTERISTICS: 
(A) LENGTH: 581 base pair?; 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SFQ ID NO:22: 
GTACTATATI GTTGCTATTA ACACACTGTT AGGAACGCGA AACCATGCAG ATCT1 CGTGA. 60 

AGACACTGAC GGGCAAGACG ATCGCGC-CG AGGTGGAA'IC CAGCGACACC A 1 1GAGAACG 120 

rUAAGGCGAA GATCCAGGAC AAGGAGGGCA TTCCGCCGGA CCAGCAGCGC CTGATCTTCG 180 

CTGGCAAGCA GCTGGAGGAC GGCCGCACGC TCGCAGACTA CAACATCCAG AAGGAGTCCA 240 

CGCTGCACCT TGTGCTGCGC CTGCGCGGTG GTGTGATGGA GCCGACACTT GAGGCCCTGG 3 GO 

CGAAGAAGTA CAACTGGGAG AAGAAGGTAT GCCGCCGCTG CTACGCCCGT CTGCCGGTGC 360 

GTGCGTCCAA CTGCCGCAAG AAGGCATG7G GCCACTGC1C CAACCTCCGC ATGAAGAAGA ^20 

AGCTGCGGTA GTCTGCGATG CTGTGGACCG ACGCATTGAA ATACACACCG TCTTCGGCGT 480 

TCCTTTTTTT TATATGTCTT 11.111 TATT GAGAAGATGT CTTGTTTGTT GTTGTTTTTT 540 
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TTTCAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A 
(2) INFORMATION FOR SEQ :U NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 ammo acids 

(B) TYPE: amino acd 

(C) STRANDFDNESS: 

(D) TOPOLOGY: I inea r 

(xi) SrOUENCE DESCRIPTION: SEO ID ND:23; 

Leu Pro Pro Leu Leu Pro Ser Ser Asp Val Pro Glu Gly Met Glu Leu 
1 5 10 15 

Pro Pro Icj Leu Pro Ser Ser Asp lie Pro Glu G"'yMet 3 In 
23 25 ~ 30 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: ammo dcid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUF.NCE DESCRIPTION: SEQ ID N0:24: 

Gly Cys Leu Leu Cys Phe P-o Arg Tfir Cys Pre Lys Ala Trp Se- 
15 10 15 

Cys Leu Leu Cys Phe Leu Pro Arg Thr Tyr Pro Lys Ala Trp Ser Cys 
20 25 30 
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His Leu Cys Phe Leu Pro Arg Ihr Tyr Pro Arg Ala Trp Stir Cys His 
35 40 45 

Leu Cys Phe Leu Pro Arg Tt Cys Pro Lys A" a >p Scr Cys His I. en 
50 55 60 

Cys Phe leu Pro Arg Ihr lyr Pre Arc] Ala Irp Ser Cys His Leu Cys 
65 70 75 BO 

Phe Leu Pro Arg ihr Tyr Pro Arg V'al Trp 
85 90 

<2> INFORMATION I- OR SEQ ID NO: 25: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LLNGIH: 90 amino acids 

(B) TYPE: amino acid 
(C3 STRANOEDNESS: 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

Ala Ala Ser Ser Ala Ser Phe teu Gly Arg Ala Arg Arg His Gly Aia 
15 i: 15 

Ala Ser Ser Ala Ser Phe Leu Gly His Tnr Arg Arg His Cy Ala Ala 
20 25 30 

Thr Ser Ala Ser Phe Leu Gly Arg Ihr Arg Gly -Ts Gly Ala Ala Ihr 
35 40 45 

Ser Ala Ser Phe Leu Gly Arg Ala Arg Arg His Gly Ala Ala Thr Scr 
50 55 60 

Ala Ser Phe Leu Gly Arg Trr Arg Gly His Gly Ala Aia Thr Ser Ala 
65 70 75 80 
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Sc Phc lcu Gly Arg Thr Arg Gly Ms Gly 
85 90 

(?) INFORMATION KOR SI.Q ID N0:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) IYPF: amino acid 

(C) STRANDrDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:26: 

Ser Val Pro Gly Lys Arg Leu A-g Asn Ser His Gly Lys Ser Leu Arg 
1 a 10 15 

Asn Val His Gly Lys Arg Pro Lys Asn Glu His Gly Lys Arg Lei; Arg 
20 ?.b 30 

Ser Val Pro Asn G^u Arg Leu Arg 
35 40 

(2) INFORMATION TOR SLQ ID NC:27: 

(-) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 40 amino acids 

(B) TYPL: amino acid 

(C) STRANDEDNESS: 
(2) TOPOLOGY linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID N0:27: 

Glu Ala Glu Glu Leu Ala Arg Gin Glu Ser GIj Glu Arg Ala Arg G7n 
1 5 10 15 
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G"u Ala Glu Glu Arg Ala Trp (nr. Glu Ala Glu Glu A<*g Ala Gin Arg 
ZZ 25 30 

Glu Ala 6'ili Glu Arg Ala Gin Ary 
35 40 

(2) INFORMATION FOR SEQ ID N0:?8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 56 amino acids 
CB) TYPE: amino acid 
(C) STRANDEDNESS: 
CD) TOPOLOGY: 1 inear 

(xi) SEQUENCfc UL5CK1 P I ION : StCj ID NO:28: 

Scr Trp Gin Ser Gin Ser Hi:; Glr Gin Gin Val Pre Thr Cys Aid Arg 
1 5 10 15 

Gin Ser Ary Ser His Gin Gin Glr Ala Pro Lys ~rp Ser Trp Gin Ser 
20 25 30 

Gin Ser His Gin Gin Gin Val Pro Thr Cys Ale Ary Glr Ser Arg Ser 
35 40 4b 

His Gin Gin Gin Val Pro In- Trp 
50 55 

(2) iNI ORMAI ION FOR SLO IU N0:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
CD) TOPOLOGY: linear 
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Ui> SEQUENCE DESCRIPTION : SEO ID NO: 29: 

Gly Arg Gly Arg A" a Lys Ala Thr Asr 5er A~g Cys Arg Arg Val Arg 
15 1C 15 

G'ly Ary Ala Glu Ala Thr Scr Ser Arg Arg Arg Ser Gly Arg Gly Arg 
?0 25 30 

Ala Lys Ala Thr Snr Se- Arg Cys Arg Pro Val Ar-g Gly Arg Ala Glu 
35 40 45 

Ala Thr Asn Ser Arg Cys A r g Arg 
SO 55 

(2) INFORMATION FOR SEC ID N0:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LLNGiH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
(0) TOPOLOGY: linear 

(xi) SEQUENCC DESCRIPTION: SEQ 10 NO: 30: 

Val Val Ala Glu Pro Lys Pro Pro Thr Ala Gly Ala Asp Val Cys A" a 
1 5 10 15 

Ala Glu. Pre lys Pro Pro Ala Ala Gly Ala Giu Val Val Val Ala Glu 
20 25 30 

Pro Lys Pro Pro Ala Ala Gly Ala Asp Val Cys Ala Ala Glu Pro lys 
35 40 -15 

Pro Pro "Hir Ala Gly Ala Asp Val 
50 55 
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(2) 1 FORMATION TOR SEC It) NO:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRAN3EUNLSS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:31: 

Pro Pro Ala Lys Ala Ala Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 32: 

<U SfcQUhNO OARACTERlSliCS: 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEOUENCE DESCRIPTION: SEO ID N0:32: 

Val Leu Gin Lys Glu Arg Asp Glu Ala Va" Ala Glu As.n Ala G*n Lou 
15 10 15 

Gin Lys Glu Arg Asp Asp Ala Val Ala Glu Asn Ala Gin Leu Gin Lys 
20 25 30 

Glu Arg Asp Aso Ala Val Ala Glu Asn Ala Gin Leu Gin lys Glu Arg 
35 4C 45 

Asp Asp Ala Val Ala Glu Asn Ala Gin Leu Gin Lys Glj Arg Asp Asp 
50 55 60 
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Aia Val Ala Glu Asn Ala Gin Leu Gin . ys Glu Arg Asp Glu A'ia Val 
65 70 75 60 

Ala Gin Asn Ala Gin Leu Gin Arg Glu Arg Asp Asp Ala Val Ala Glu 
35 9C 95 

Asd Ala Gin Leu Gin Lys Glu Arg Asp Glu Ale Val Ala G"iu Asn Ala 

ioo 105 no 

Gin leu Gin Arg Glu Arg Asd Glu A!a Val Ala Glu Asn A'-d Gin Leu 
115 120 125 

Gin Lys Glu Arg Asp Asp Val Val Ala Glu Asn Ala Gin leu Gin lys 
130 135 L43 

Glu Arg Asp Asp Ala Val Ala 
145 150 

(?) IN FORM A I ION FOR SEQ ID N0:33: 

(U SEQUENCE CHARACTERISTICS: 
(AJ LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDfcDNfcSS : 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Cys Arg Arg Arg Gly Met Lys Pre Trp Arg Arg Met Pro Ser Cys Arg 
1 5 in is 

Arg Arg Gly Met Thr Pro Trp Arg Arg Met Pro Ser Cys Arg Arg Arg 
20 25 30 

Gly Met Thr Pro Trp Arg Arg Met Pro Ser Cys Arg Arg Arg Gly Met 
35 40 45 
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Thr Pro "irp Arc Arg Met Pro Ser Cys Arg Arg Arg Gly Met Thr Pry 
50 55 60 

Trp Arg Ary Met Pro Ser- Cys Arg Arc; Arg Gly Thr Lys Pre Trp A"-y 
65 70 75 80 

Arg Met, Pro Ser Cys Arg Gly Arg Gly Met lhr Pro Iro Arg Arg Met 
85 90 9i 

Pro Ser Cys Arg Arg Arg Gly Met Lys Pro Trp Arg Arg Met Pro Ser 
100 10o 110 

Cys Arg Gly Arg Gly Met Lys Pro Trp Arg Arg Met Pro Ser Cys Arg 
115 12C 125 

Arg Arc Gly Met Thr Se- Trp Arg Arg Met Pre Ser 
130 135 14C 

(2) INFORMATION FOR SEQ ID N0:34: 

(i) SFQIJFNCF CHARACTFRISTTCS: 
(A) LENGTH: 60 amino acids 
(R) TYPF: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:34: 

Gly Tyr Val Arg Gly Arg Lys Gin Arg T-p Gin Leu kis A" a Phe G"y 
15 10 15 

Tyr Val Arg Gly Arg Lys Gin Arg Trp G'n Leu His Ala Phe Gly Tyr 
20 25 30 
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Val Arg Gly Arg I ys Gin Arg Arg Gin Leu His Ala Cys Gly Tyr Val 
35 40 45 

Arg Gly Ary lys Gin Arg Trp Gin Leu His Ala Cys 
50 5b 60 

(2) INFORMATION FOR Srq ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 60 amino acids 
(13) IYPL: amino acid 
tC) STRANDEDNFSS : 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 35: 

Gly Jrr Ser Glu Glu Gly Ser Arg Gly Gly Ser Ser KeL Pro Ser Gly 
1 5 10 15 

Thr Ser Glu Glu Gly Ser Arg Gly Gly Ser Ser Met Pro Scr G:y Thr 
?0 25 30 

Scr Glu Glu Gly Ser Arg Gly Gly Ser Ser Met Pro Ala Gly Thr Ser 
35 40 45 

Glu Glu Gly Ser Arg Gly Gly Ser Ser Met Pro Aid 
50 55 60 

(2> INrORMAI ION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTFRISTICS : 

(A) LENGTH: 60 amino acids 

(B) TY^E: amino acid 

(C) STRANDEDNESS: 
(0) IOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SLO ID HQ :3b: 

Val Arg Pro Arg Lys Glu Ala Glu Val Ala Ala Pro Cys Leu Arg Val 
1 5 10 lb 

Arg Pro Arg Lys Glu Ala Glu Val Ala Ala f >r X) Cys Leu Arg Va" Arc; 
20 25 30 

Pro Arg Lys Glu Ala Glu G-u Ala Ala Pro Cys Leu Arg val Arg Pro 
35 40 45 

Arg Lys Glu Ala Glu Val A" a Ala Pro Cys Leu Arg 
50 55 CC 

(2) INFORMATION FOR SFQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LLNGIH: 639 amino acids 
(R) TYPE: amino add 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SCO ID N0:37: 

Asp Ala Ser Val Val Asp Leu Gly Gly Glu Ala Ms Gly Thr His "fyr 
15 10 15 

Ala Phe Leu Pro Asp Val lie lys Gly lie Ala Gin Glu Glu Leu Tyr 
20 ?$ 30 

Leu Glu Asp Asp Ala Tyr Phe Gin Glu Leu Leu Ala Arg Tyr Lys Glj 
35 40 45 

Leu Val Pro Val Gly Ala Glu Pro Thr Glu Pro Arg Ala Lys Gin Leu 
50 55 60 
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Arg Glu Gin Mer. Arg He Arg Ala Gly G"n Leu Ala Veil Asp Thr Arg 
65 70 75 SO 

Lys Leu His Ala Ala Glu Glu Arg Ala Ala Ser Arg Met Aia Ihr Leu 
85 90 95 

Tyr Pro Phe Va: Gly Ser Ala Pro Leu Gly Vat Ala Leu Trp Asn lie 
100 105 no 

Pro Val Glu Aia Asp Glu Glu Phe Cys Ala Leu Leu Leu ^ys Arg Glu 
115 120 l?b 

Glu Ala Leu Ala Gly Lys Scr Gly Ser Val His Glu Val Gu Ser Ala 
130 135 140 

Leu Ser Ala Arg Ala Glu Ala Met Ala Lys Ala Val Leu Glu Glu Glu 
145 150 15b 160 



Glu Ala Leu Ala Ala Ala Phe Pro Phe Le.j Gly Arg Ser Vnl Lys Gly 



165 



170 



175 



Ala Pro Leu Arg Glu Leu Ala Leu Met Ser Asp Pro Asn Pne Ala Glu 
160 185 190 

Leu Ala Thr Arg His Ala Gin Glu Ala Thr Ser Gly Asp Ala Ala Gly 
195 200 205 

Jle Leu Arg Leu Glu 61 n Glu Leu Arg Asp Gin Ala Cys Arg He Ala 
210 21b 220 

Arg Glu Val Arg Val Ala Arg Arg Leu Asp Aia Xaa Arg Asn Glu Asp 
225 230 23b 240 



Leu Hts Glu Arg Tyr Pro Phe Leu Pro Glu Glu Pro Val Arg Gly He 
?45 250 255 
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Leu Leu Gly Ala Val Arg Pro Val Gin Gin Pro Ala Phe Arc: Glu Leu 

260 265 27 Z 

Scr Asn Lys Leu Asp Glu Gin Arg Arg Asp Pre Thr Arg Asn Ala Aia 

275 280 285 

Ala He Arg Ihr Thr Glu Glu Gin Met Thr Ala Leu Val Val Ary Leu 

290 295 300 

Ala Glu Glu Arg Ala Glu Ala Thr Glu Arg Ala H ; s Glu Gin Tyr Pro 

305 310 315 320 

Phe Leu Pro Arg Arg Val leu Gly Val Arg leu Gly Asp He Ser Leu 

3?5 330 335 

Gin Glu Asp Aso Val Leu Ser Gin Leu Ala Arq Arg A^g Val Arg Gin 

340 345 350 

Leu Arg Asn Ser Lys Thr Ala lie Asp Ala His A" a Thr Glu Glu Glu 

355 360 365 

Met He Arg Arg Ala Glu Glu Leu Ala Arg Asn Val Lys Leu Val Asp 

370 375 380 

Ala Tyr Arg G^y Asn Gly Asn Glu Tyr Val Arg Ala Cys Asn Pro Phe 

385 390 395 400 

Leu Val Tyr Glu Asp Arg Lys Cys Val Leu Leu Ser Glu Leu Pro Leu 

40b 410 4] 5 

Aia Gly Gly Asp Val Tyr Gin Gly Leu Phe Arg Asn Tyr Leu 'In- Ala 

420 425 43C 



Leu Glu Asp Ala Glu Ala Asn Ala Pro Arg He Ala Glu Leu Glu Asn 
435 440 445 



Printed from Mimosa 01/07/09 11:43:56 Page: 64 



WO 97/1 847S 



CT/US96/18624 



63 



Ala Leu Arg Ser Arg Ala Asp Glu Leu Ala Leu Glu Val Cys Glu Arg 
450 455 460 

Asp Ala Arg Leu Leu His Tyr Ser Phc Leu Ser Ala Gin Asp Val Pro 
465 470 475 480 



Gly Trj Ser Glu Ala Leu Leu His Asp Ala Glu Phe Gin Gin Leu Arq 



485 



490 



Glu Arg Tyr Glu Glu Leu Ser Lys Asp Pro Gin Gly Asn Ala Glu Ala 
500 50b 530 

Leu Arg Glu Leu Glu Asp Ala Met Glu Ala Arg Ser Arg Ala He Ala 
515 520 525 

Glu Ala Leu Arg Thr Ala Glu Xaa Thr Asn Xaa !hr Glu Gin Ala Arg 
530 535 540 

Leu Lys ihr P-o Ser G"n Ala Gly Ser Gly Val Ser Ala Gly Asp Arg 
54b 5b0 555 560 

Met His Gly Ser Glu His Ala Asp Leu Ala His Glu Gly Gly Ser !hr 
555 5/0 575 

Ala Gly Gly Thr Met Arg Gly Ala Glu Ser Val Ser Lys Ser Ser Gly 
580 585 590 

Lys His Ser Xaa Arg Ser Val Ser His Ala Ser Val Val Asp leu Gly 
595 600 605 

Gly Giu Ala His Gly Thr His Tyr Ala Phe Leu Pro Asp Vel lie Lys 
610 615 620 



Gly He Ala Gin Glu Glu Leu Tyr Leu Glu Asp Asd Ala lyr Phe 
625 630 635 
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(2) INFORMATION FOR SEQ 10 NO: 38: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23i amino acids 

(B) TYPF: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:38: 

Ala Arg Ala Val Lou "yr Cys Arg Arg Ala Ala Met Gly He \'a'. Arg 
15 10 15 

Ser Arg Leu His Lys Arg Lys He Thr Gly Q'y Lys Thr Lys lie Hir, 
20 25 30 

Arg Lys Arg Met Lys Ala G 1 u Leu Gly Arg Leu Pro Ala His Thr I ys 
35 40 45 

Leu Gly Ala Arg Arg Val Ser Pro Val Arg A;a Arg Gly Gly Asn Phe 
50 55 60 

Lys Leu Arg Gly Leu Arg Leu Asp Thr Gly Asn Phe Ala Irs Ser Thr 
65 70 75 80 

Glu Ala lie Ala Gin Arg Ala Arg He i eu Asp Val Val Tyr Asn Ala 
85 90 95 

Thr Ser Asn Glu Leu Val Arg Thr Lys Thr Leu val Lys Asn Cys He 
100 105 110 

Val Val Val Asp; Ala Ala Pro Phe ;.ys .eu Trp Tyr Ala Lys His Tyr 
115 120 125 

Gly He Asp Leu Asp Ala Ala Lys Ser Lys Lys Ihr Leu Gin Ser Thr 
130 135 140 
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Thr GIij l.ys Lys Lys Sor Lys Lys Thr Scr His A", a Met. Thr Glu Lys 
145 150 155 160 

Tyr Asp Val lys lys Ala Ser Asp Glu Leu Lys Arg Lys T rn Met Leu 
165 170 175 

Arg Arg Glu Asn his lys lie Glu l.ys Ala Val Ala Asp Gin Loj Lys 
180 185 190 

Glu Gly Arg Leu Leu Ala Arg He Thr Ser Arg Pro Gly Thr Ala Arg 
195 200 205 

Ala Asp Gly Ala Leu Leu Glu Gly Ala Glu Leu Gin Phe Tyr Leu Lys 
210 215 220 

Lys Leu Glu Lys Lys Lys Arg 
225 230 

(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE: CHARACTERISTICS: 

(A) ..ENGTH: 172 atrino acids 

(B) TYPE: amino ace 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Ala Arg Glu Lys Arg Lys Gin Thr Asn Lys He lys Asn Ser Lys Ser 
15 10 15 

He Thr Ser Thr Met Ser Glu Glu Ser Ala Phe Tyr Ala Phe A". a Ser 
20 25 30 
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Phe Gly Gly Ala Pro lh" Lys Clu Met. Asp Asn Ala His Phe Ser i_ys 
3b 40 45 

Met Leu lys 61 j Tnr Lys Val He Gly Lys Gin Phe Thr 5er Thr Asp 
50 5b 60 

Ala Asp Leu Leu Phe Asn Lys Val Lys Ala Lys Gly Aia Arg Lys He 
65 70 ;/b (30 

Thr Leu Ser Asp Phe Val Asp Lys A" a Va" Pre Glu He Ala Ser Lys 

8b 90 95 

Leu Lys Lys Ser Ala Glu Glu Leu Tie Ala Asp He Ser Ser Cys Se- 
100 105 110 

Pro Glu Ala Arg Ala Thr Lys Ala Ass Ala Val Lys Phe His Asp Asp 
115 1?0 1?5 

Lys Asn Mel Tyr Thr Gly Val Tyr Lys Ala Gly Gly Pro Thr Asn Val 
130 135 140 

Asp Arg Asn Ser Gly Ser leu Ser Gly Val Val Asp Arg Arg Val Ala 
145 150 155 160 

Gin Thr Asp Val Arg Gly Thr Thr A": a Ser Gin Lys 
165 170 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: ammo acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(x- j SEQLJFWCF- DESCRIPTION: SEQ ID NO: 40: 

Ale Arg G1j leu Ser Ser Ser Val Mot Thr Leu Gly I ys Asn Lys Arg 
15 10 15 

He Scr Lys Gly Gly lys Arg Gly Lys Lys Lys Thr Gin Glu Thr Met 
20 25 30 

Ser Arg Lys Glu irp Ty- Asp Val Val Ala Pro Lys Asn Phe Glu Val 
35 40 46 

Arg Glr Phe Gly Lys Thr He Cys Asn lys Thr Gin Gly Ihr Lys He 
00 55 60 

Ala Ala Asp Tyr Leu Arg Gly Arg Val Tyr Glu Ser Asn Leu Ala Asp 
6b 70 75 80 

Leu Asr. Lys Thr Glr. Gly Asp Asp Asp Ala Tyr Arg Lys Val lys Phe 
85 90 95 

Val Val Gin Glu Val Gin Gly Arg Asn leu Leu Thr Gin Phe His Ser 
100 10b 110 

Met Glu Met Thr Ser Asp Arg Val Tyr Phe Leu Leu Arg Lys Trp Cys 
115 120 125 

'hr Thr He Glu Ala Ala Val Glu Thr Lys Ihr Ala Asp Gly lyr Thr 
130 135 140 

Leu Arg Leu Phe Val lie Ala Phe Thr Lys Lys Gin Ser Asn Glr; Leu 
145 150 155 160 

Ser lys Asn Cys Tyr Ala Lys Thr Arg Leu Val Lys Trp Val Arg His 
165 170 175 
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Arg lie Thr Asn Leu lie Arg Gin Arg Leu Ser Lys Va I Asn lie Asn 
180 185 190 

Glu Ala Val Thr Leu Leu Thr Arg Asn lie Leu Arg Aso Arg Leu Ala 
195 200 205 

Lys Arg Cys Asn Pro He Val Pro Leu Arg Asp Leu Arg lie Arg Lys 
?10 215 220 

Val Lys Val Val Ary Thr Pro Arg Phe 
225 230 

(?) INFORMATION FOR SE0 ID NO: 41: 

(i) SCQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 
CD) TOPOLOGY: ; inear 

(xi) SEQUENCE INSCRIPTION: SEQ ID N0:41: 

Met Gin lie Phe Val Lys Tlr Leu Thr Gly Lys Ihr He Ala Leu Glu 
15 10 15 

Val Glu Ser Ser Asp Ihr He Glu Asn Val lys Ala Lys He Gin Asd 
20 25 30 

Lys Glu Gly He Pro Pro Asp Gin Gin Arg Leu lie Phe Ala Gly Lys 
35 40 45 

Gin Leu Glu Asp Gly Arg Thr leu Ala Asp Tyr Asn lie Gin Lys Glu 
50 55 50 

Ser Thr Leu His Leu Val Leu Arg Lej Arg Gly Gly Val Met Glu Pro 
65 70 75 80 
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Thr leu Glu Ala Leu Ala Lys :.ys Tyr Asn Trp Glu ^ys Lys Vol Cys 
85 90 95 

Arg Arg Cys lyr Ala Arg Leu Pro Va- Arg Ala Ser Asn Cys Arg Lys 
100 105 110. 

Lys Ala Cys Gly His Cys Ser Asn Leu Arg Met Lys Lys Lys I en Arg 
115 120 125 

(2) INFORMATION FOR SEC ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LbNGiH: 145 ammo acids 
(3) TYPE: amino acd 

(C) STRANDEDNESS : 

(D) TOPOLOGY: 1-r.ear 

(X1) SEQUENCE DESCRIPTION: SEQ 13 NO:42: 

Arg Leu Pro Pro Leu l.cu Pro Ser Ser Asp Val Pro Glu Gly Met Glu 
1 5 10 15 

Leu Pro Pro Leu Leu Pro Ser Ser Asd He Pro Glu G'y Met Glu Leu 
20 25 30 

Pro Pro Leu Leu Pro Ser Ser Asp Val Pro A": a Gly Met Glu Leu Fhr 
35 40 45 

Pro Leu Leu Pro Ser Se- Asp Va! Pro Glu Gly Met Glu Leu Pro Pro 
50 55 60 

Leu Leu Pro Ser Ser Asp Val Pro Ala Gly Met Glu Leu Pry Pr-o Leu 
65 70 75 80 
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Xaa Pro Ser Ser Asp Val Pro Ala Gly Met Gki Lou Pro Pro Leu l.cu 
85 90 9b 

Pro Ser Ser Asp Val Pro Ala Xaa He Glu Leu Pro Pro Leu lie Se- 
100 105 no 

Xaa Leu Gly Arg Thr Xaa Arg Xaa Gly Asp Xaa Ser Ser Xaa Ser Cys 
115 120 12b 

Leu Gly Arg Xaa Xaa Arg Xaa Arg Xaa Ala Pro Leu Xaa Pr-o Xaa Ser 
130 136 140 

Glu 
145 

(2) INFORMATION f-OR SEQ ID N0:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPfc- amino acid 
<C) STRANQEDNESS : 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SCQ ID N0:43: 

Glu Lys Glu Arg Arg Phe Pro Thr lys Thr Ala Arg Ala Asp Pro Thr 
15 10 15 

Ihr Thr Lys Gin Leu He Tie Arg Ala Leu Gin Asn He Ser Leu Ala 
20 25 30 

Phe Gly He Glu Pro Ser Ser Thr Val Lys Tyr Ala Glu Ser T.nr Gin 
35 40 4b 

Glu Glu Asn Gly Lys Arg Ser Gin Ser Glu Ala Glu Glu Arc Ala Arg 
50 55 60 
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Arg GUj Ala Glu G"*u Arg Ala Arg Arg Glu Ala Glu Glu Arg Ala Gm 
65 7C 75 80 

Arg Glu Ala Glu Glu Arg Ala Gin Arg G1:j Ala Glu Glu Arg Ala Arg 
85 9D 95 

Arg Glu Ala Glu Lys Arg Ala Arg Arg Glu Ala Lys Glu Arg A;a Trp 
100 10= ll(i 

Gin G'iu Ala G'u Glu Arg Ai a 37 n Arg G")U Ala Glu Glu Arg Ala Arg 
115 120 12b 

Arg Glu Ala Glu Glu Arg Ala Arg Arg Glu Val G'u Glu Arg Ala Arg 
130 135 140 

Gin Glu Ala Glu Glj Leu Ala Arg G";n Glu Ser Glu Glu Arg Ala Arg 
I 4 * 150 155 160 

Gin Glu Ala Glu Glu Arg Ala Trp Gin Glu Ala Glu Glu A-g Ala Gin 
165 170 17b 

A-g Glu Ala Glu Glu Arg A'a Gin Arg Ala 
180 185 

<2) INFORMATION FOR SCO ID NO:44: 



(i) SFQULNCE CHARACTERISTICS: 

(A) LENGTH: 106 am-. no acids 
(0) lYPt: amino acid 
<C) STRANDEDNESS: 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEC 10 N0:44: 

Gly Arg Gly Arg Ala ! ys Ala Thr Asn Se~ Arg Cys Arg Arg Val A-g 
1 5 10 lb 

Gly Arg Ala Glu Ala Thr Ser Ser Arg Arg Arg Sc-r Gly Arg Gly Arg 
20 25 30 

Ala Lys Ala Thr Ser Ser Arg Cys Arg Arg Val trg Gly Arg Val Glu 
35 40 45 

Ala Thr Asn Ser Arg Cys Arg Arg Gly Arg Gly Arg Ala Lys Val Thr 
50 55 60 

Ser Ser Arg Xad Arg Arg Val Xaa Gly A-g Xaa Xaa Xaa Thr Ser Xaa 
65 70 75 80 

Arg Xaa Arg Arg Xaa Arg Gly Arg Xaa Xaa Val Thr Ser Arg Ary Xaa 
85 9C 9b 

Arg Arg Xaa Xaa Gly Arg Gly Asp Val Thr 
iOu 10b 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SCQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANQEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCF INSCRIPTION: SEQ ID NO: 45 

Ser He Pro Val Glu lie Asp He Arg Asn Gin Asp Phe Ser Phe Leu 
1 5 10 15 
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Asp Pro Ala Pro Glu Gly He Pro He Gin Asp He 4is Lej Met Gly 
20 25 3C 

Asp Ser Ala Phe Ala Ala Ser Ala Arg Glu Arg Met Lys I e.i Lys Arg 
35 40 4b 

Asn Pro Val Ala Asn Ala Ser Lys :le Ser Ala Leu Glu Glu Glu Met 
50 55 60 

Asp Gin Arg Ala His Val Leu Ale Lys Gin val Arg Asp Lys Gl^ Arg 
65 70 75 80 

Tnr Phe Leu Asp Pro Glu Pro Glu Gly Vol Pro _eu Glu Leu Leu Ser 
85 90 95 

leu Asn Glu Asn Glu Ala Ser Gin Glu Leu Glu Arg Glu Leu Arg Ala 
100 105 110 

Leu Asn Arg Lys Pro Arg Lys Aso Ala Lys Ala He Val Ala Leu Glu 
■ 15 120 125 

Asp Asp Val Arg Asp Clu His Thr Cys Leu Pro Arg Ser 
130 135 140 

(2) INFORMAL ION FOR SEQ 10 NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 27 aniino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
V.:) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID K0:46: 

Arg Lys Met Ser Gly Thr Ser Leu Leu Ala Pro Gin °ro Glu Gly Vai 
1 5 10 15 



^^CT/U S96/1 8624 



Printed from Mimosa 01/07/09 1 1 :43:59 Page: 75 



WO 97/18475 



PC.T/LS96/ 18624 



74 



Pro Val Ser Glu Leu Ser Leu Asp leu Asp Glu 
20 ?5 

(2) INFORMATION FOR SFQ ID NO: 4/: 

(•;) SEQUENCE CHARACTERISTICS. 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi: SEQUENCE DESCRIPTION: SEO 10 NC:47: 

Leu Leu Ala Leu Leu Gin Gly Leu Val Gin Leu Arg Thr Gin Tie Mis 
1 5 10 15 

Gly Val Arg Pro Ala Leu Leu Pro Glu Scr Gly Gin Phe Leu Gly Gly 
20 25 30 

Ser Leu Gin Leu Ala Met His Leu Leu Ala Leu _eu Gin Gly Leu Val 
3r> 40 4b 

Gin Leu Ary Thr Gin lie his Gly Val Arg Pro Ala Leu Leu Pro Glu 
50 55 60 

Ser Gly Gin Phe Leu Gly Gly Ser Leu Gin Leu Ala Met His Leu Leu 
65 70 75 . 80 

Ala Leu Leu Gin Gly Leu V'a" Gin Leu A-g Thr Gin He H's Gly Val 
85 90 95 

A'-g Pro Ala Leu Leu Pro Glu Ser Sly Gin Phe Leu Gly Gly Ser Leu 
100 105 UC 
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G";n Leu Ala Thr H J s 
115 

(2) INFORMATION FOR StQ ID NO: 48: 

(i) SEQUENCE CHARACITRISTICS: 

(A) LENGTH: 11? amino Acids 

(B) TYPE; amino acid 

(C) STRAMDEDMESS : 
X) TCPOI.OGY: linear 

(xi) SEQUENCE DESCRIPTION: SLQ ID NO:48: 

5er Scr Arg Cys Cys Lys Ala Ser Ser Ser Cys Ala Arg Arg Phe Thr 
15 10 15 

Val Ser Ala Pro Leu Cys Ser Arg Arg Ala Ala Ser Ser Ser v'al Val 
20 25 30 

Arg Phe Ser Ser Arg Cys Thr Ser Ser Arg Cys Cys Lys Ala Ser Scr 
35 40 45 

Ser Cys Ala Arg Arc Phe Thr Val Ser Ala Pro Leu Cys Ser Arg Arg 
50 55 60 

Ala Gly Ser Scr Scr Val Val Arg Phe Ser Ser Arg Cys Th- Ser Ser 
65 70 75 80 

Arg Cys Cys Lys Ala Ser Ser Ser Cys Ala Arg Arg Phe Thr Val Ser 
85 90 95 

Ala Pro Leu Cys Ser Arg /rg Ala Gly Ser Ser Ser Val Val Arg Phe 
100 105 110 

Ser Ser Arg Arg Tnr 
115 
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(2) INFORMATION FOR SEQ ID NO: 49: 

( i) SEQUENCE ChARAf.TF R T ST 7 CS : 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) S1RANUEDNESS: 
(0) rO?OlOGY: 1-near 

(xi ) SFQIOCE DESCRIPTION: SEQ ID N0:49: 

Pro Pro Arg Ala Ala Ala Arg Pro Arg Pro Ala Ala His Ala Asp Se^ 
I 5 13 15 

Arg Cys Pro Pro Arg Ser Ala Pro Gly Glu Arg Pro Val Pro Arg Trp 
20 25 3C 

Phe Ala Ser Ala Arg Asp Ala Pro Pro Arg Ala Ala Ala Arg Pro Arg 
35 ^0 AS 

Pro Ala Ala His Ala Asp Ser Arg Cys Pro Pro A"g Ser A : a Pro Gl.y 
50 55 60 

Glu Arg Ala Val Pro Arg Trc Phe Ala Ser Ala Arg Asp Ala Pro Pro 
65 70 75 80 

Arg Ala Ala Ala Arg Pro Arg Pro Ala Ala His Ala Asp Ser Arg Cys 
fl5 90 gr> 

Pro Pro Arg Ser Ala Pro Gly Glu Arg Ala Val P'ro A"g Trp Phe Ala 
100 105 110 

Ser Ala Arg Asp Ala 
11S 
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(2) INFORMATION FOR SEC ID NO: 50: 

(1) SEQUENCE CHARAC T ERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acic 

(C) STRANDEDNFSS: 

(D) TOPOLOGY: linear 

<x1 5 SEQUENCE DESCRIPTION: SEQ ID N0:50: 

Ala Glu Pro Lys Pro Ala Glu Pro Lys Ser Ala Glu Pro Lys Pro Ala 
15 70 15 

Glu Pro Lys Ser Ala Glu Pro Lys Pro Ala Glu Pro Lys Ser Ala Glu 
20 25 30 

Pro Lys Pre Ala Glu Pro Lys Ser Ala Gly Pro Lys Pro Aid Glu Pro 
35 40 45 

Lys Ser Ala Glu Pro Lys Pro Ala Glu Pro Lys Ser Ala Glu Pro Lys 
50 55 60 

Pro Ala Glu Pro lys Ser Ala Glu Pro lys Pro Ala Glu Pro l ys Ser 
65 70 75 80 

Ala Glu Pro Lys Pro Ala Glu Ser Lys Ser Ala Glu Pro Lys Pro Ala 
85 90 95 

Glu Pro Lys Ser Ala Glu Pro Lys Pro Ala Glu Ser Lys Ser Ala Glu 
100 105 110 

Pro Lys Pro Ala Glu Pro lys Ser Ala G.u Pro Lys Pro Ala Glu Pro 
115 120 125 

Lys Ser Ala Glu Pro Lys Pro Ala Glu Pro lys Ser Ala Glu Pro Lys 
130 135 140 
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Pro Ala Glu P^o Lys Ser Ala Glu Pro Lys Pro Ala Glu Ser : ys Se*- 
145 150 155 160 

Ala Gly Pro Lys Pro Ala Glu Pro Lys Ser Ale Glu Pro Lys Pro Ala 
165 1/0 175 

Glu Pro Lys Ser Ala Glu Pro lys Pro Ala Glu Pro Lys Ser Ala Glu 
130 185 190 

Pro Lys Pro Ala Glu Pro Lys Ser Ala Glu Pro Lys Pro Ala Glu 
195 200 205 

(2) INFORMATION TOR SL~Q ID N0:51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino ac=ds 

(B) TYPE : amino acid 

(C) STRANDLDNtSS : 
CD) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Arg Arg Gly Tyr Pro Arg Ser Arg Met Pro Ser Lys G"u <_eu Trp Met 
1 b 10 15 

Arg Arg Leu Arg lie Leu Arg Arg Leu Leu Arg Lys Tyr Arg Glu Glu 
20 25 30 

lys lys He Asp Arg His lie Tyr Arg Glu Leu lyr val Lys Ala lys 
35 40 45 

Gly Asn Val Phe Arg Asn Lys Arg Asn Leu Met Glu His lie His Lys 
50 55 60 
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Val Lys Asn Glu Lys i_ys Lys GUj Arg Gin Leu Ale Glu Gin Leu Ala 

65 /0 /b 80 

Ala Asn Ala Xaa Lys Aso Glu Gin His Arg His Lys Ala Arg lys Gin 



90 



95 



Giu Leu Arg Lys Arg Glu Lys Asp A-g Glu Arg A"! a Arg Arg Glu Asp 
100 105 :io 

Ala Ala Ala Ala Ala Ala Ala Lys Gin Lys Ala Ala Ala Lys lys Ale 
US 120 125 

A" a Ala Pro Ser G;y Lys Lys Ser A* a Lys Ala Ala He Ala Pro Ala 
130 135 140 

Lys Ala Ala Ala Ala Pro Ala lys Ala Ala Ala Ala Pro Ala Lys Ala 
145 150 155 160 

Ala Ala Ala Pro Ala Lys Ala Ala Ala Ala Pro Ala Lys Ala Ala Ale 
165 170 175 

Ala 2 ro Ala Lys Ala Ala Thr Ala Pro Ala Lys Ala Ala Ala Ala Pro 
180 185 190 

Ala Lys Ihr Ala Ala Ala Pro Ala Lys Ala Ala Ala Pro Ala Lys Ala 
195 200 205 

Ala Ala Ala Pro Ala Lys Ala Ala Thr Ala Pro Ala Lys Ala Ala Ala 
210 215 220 

Ala Pro Ala Lys Ala Ala Thr Ala Pre Ala Lys Ala Ala Thr Ala Pro 
225 230 235 240 

Ala Lys Ala Ala Ala Ala Pro Ala Lys Ala Ala TV Ala Pro Val Gly 
245 250 255 



Printed from Mimosa 01/07/09 1 1:44:00 Page: 81 



WO 97/18475 PCT/IJS96/18624 

80 



Lys Lys Ala Gly Gly Lys Lys 
260 

(2) INFORMATION FOR SEQ 10 N0:52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 442 amino acids 
(R) TYPF: amino acid 
(C) STRAWDEDNESS : 
(J) TOPOLOGY: linear 

<xi) SEQUCNCL DESCRIPTION: SLQ ID N0:52: 

Asp Phe He Irp lyr Lys Val Val A~?a leu leu Val Val He Thr Ser 
1 5 ID lb 

Asn Gly Asp Asp Val Ser Val 'lyr Thr Ala Thr Tie Lys G'u Phe lyr 
20 25 30 

A r g Tyr Leu Trp He Phe Val Pro Val Ser leu Phe Ser He lie lie 
35 40 45 

Tyr Phe Val Ser ]le Phe Cys Phe Pro Ala Se^ ly Gly Leu Phe Phe 
50 5b 60 

Ser Ser Phe Leu Lys Phe Gin Leu Leu Leu Asn his Lys His Pro Val 
65 70 75 80 

Leu Gin Pro Pro His Gin Met. Val Ser Leu Lys Leu Gin Ala Arg Leu 
85 90 95 

Ala Ala Asp He leu Arg Cys Gly Arg his Arg Val Trp Leu Asp Pro 
100 105 110 

Asn Glu Ala Ser Glu He Ser Asn Ala Asn Ser Arg Lys Ser Val Arg 
115 12C 12b 
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Lys Leu lie Lys Asp Gly leu Tie He Arg lys Pro va 1 Lys Vol His 
130 135 140 

Ser Arg Ser Arg Trp Arg His Met Lys Glu Ala Lys Ser Met Gly Arg 
145 150 155 160 

His Glu Gly Ala Gly Arg Arg Glu Gly Thr Arg Glu Aid Arg Me~_ Pro 
165 170 175 

Ser Lys Glu Leu irp Met Arg Arg Leu Arg lie Leu Arg Arg Leu Leu 
180 185 190 

Arg lys Tyr Arg Glu Glu Lys Lys lie Asp Arg His He Tyr Arg Glu 
195 200 205 

Leu lyr Val Lys Ala Lys Gly Asn Val Phe Arg Asn Lys Arg Asn Leu 
210 215 220 

Met Glu His lie His Lys Val Lys Asn Glu Lys Lys Lys Glu Arg Gin 
225 230 235 240 

Leu Ala Glu Gin Leu Ala Ala Lys Arg Leu Lys Asp Glu Gin His Arg 
245 250 255 

His Lys Ala Arg lys Gin Glu Leu Arg Lys Arg Glu Lys Asp Arg Glu 
260 265 270 

Arg Aia Arg Arg Glu Asp Ala Ala Ala Ala Ala Ala Ala Lys G:n Lys 
275 280 285 

Ala Ala Ala Lys Lys Ala Ala Ala Pro Ser Gly Lys Lys Ser Ala '_ys 
290 295 300 

Ala Ala Ala Pro Ala Lys Ala Ala Ala Ala Pro Ala Lys Ala Ala Ala 
305 310 315 320 
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Pro Pro Aia Lys Thr Ala Ala Ala Pro Ala lys Ala Ala A;a Pro Ala 
32b 330 335 

lys Ala Ala Ala Pro Pro Ala Lys Ala Ala Ala a ro P-o Ala Lys Th-^ 
310 345 350 

Ala Ala Pro Pro Ala lys Thr Ala Ala Pro Pro Ala Lys Ala Ala Ala 
355 360 365 

Pro Pro Ala Lys Ala Ala A=a Pro Pro Aid Lys Ala Ala Ala Pro Pre 
370 375 380 

Ala lys Ala Ala Ala Ala Pro Ala Lys Ala Ala Ala Ala Pre Ala Lys 
385 390 395 40C 

Ala Ala Ala Pro Pro Ala lys Ala Ala Ala Pro Pro Ala lys Ala Ala 
405 410 415 

Ala Pro Pro Ala Lys Ala Ala Ala Pro Pro Ala lys Ala Ala Ala Ala 
420 425 430 

Pro Val Gly Lys Lys Ala G'y Gly Lys Lys 
435 440 

(2) 1NF0RMAIJUN FOR StQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 amino acids 
<B> TYPE: amino acid 
(C) S 1 RANDEDNESS : 
(H) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N3:53: 

Ala Ghj Pro lys Ser Ala G'iu Pro Lys Pro Alg Glu Pro Lys Ser 
15 10 15 

(?) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LFNGTH: 15 arcino acids 

(B) T V PE: amino acid 

(C) S7RANUEDNESS: 

(D) TOPOICGY linear 

(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:54: 

Ala Glu Pro Lys Pro Ala Glu Pro Lys, Se<" Aid Glu Pro Lys Pre 
1 5 10 15 

(2) INFORMATION FOR SFQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANOEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5b: 

Lys Ala Ala He Ala Pro Ala Lys Ala Ala Ala Ala Pro Ala Lys Ala 
J 5 10 ]5 

Ala Tfr Ala Pro Ala 
20 
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(2) INFORMATION I- OR SEQ ID NO: 56: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) S'IRANDLDNESS: 

(D) "I OTOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SbQ ID N0:56: 

Lys Ala Ala Ala Ala Pro Ala Lys Ala Ala Ala Ala Pro Ala Lys Ala 
1 5 in 15 

Ala Ala Ala Pro Ala 

20 

(2) INFORMATION FOR StQ ID NO:57: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LCNGTl: 22 amino acids 
(R) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPI10N: SEQ ID ND:b7: 

Gly Asp Lys Pro Scr Pro Phc Gly Gin Ala Ala Ala Gly Asp Lys Pre 
1 5 ■ 10 15 

Ser Pro Phe Gly Gin Ala 
20 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: :inenr 

(xi) SEQUENCE DESCRIPTION: SEQ II) N0:58: 

Lys Ala Ala Ala Ala Pro Ala Lys Ala Ala Ala Ala Pro Ala Lys Ala 
1 5 111 ib 

Ala Ala Ala Pro Ala 
20 

(2) INFORMATION FOR SEQ ID N0:59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) i rNGIII: 21 amine acids 

(B) TYRE: amino acid 

(C) STRANDEDNESS : 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SKQ ID N0:59: 

t.ys Ala Ala Thr Ala Pro Ala Lys Ala Ala Thr Ala Pro Ala lys Ala 
1 5 10 15 

Ala Thr Ala Pro Ala 
20 

(2) INFORMATION FOR SEQ ID NO: 60: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino adds 

(B) TYPE: amino acid 
CO STRANDEDNESS : 
CD) T0POLOG v : linear 
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(x1) SEQUENCE DCSCRIPTION: SEC ID NO: 60: 

Lys Aia Ala He Ala Pro A" a Lys Ala Ala He A" a Pro Ala Lys Ala 
1 5 10 15 

Ala He Ala Pro Ale 
20 

[2) INFORMATION FOR SEQ 10 N0:61: 

(i) SEQUENCE CHARACTERISTICS: 

CA; LENGTH: IA amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Lys Ala Ala Ala Ala Pro Ala Lys Ala Ala Ala Ala Pro Ala 
15 10 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) ( tNGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY : linear 

(xi) SLQUtNCb DESCRIPTION: SEQ ID N0:62: 

Lys Ala Ala Ala Ala Pro Ala Lys Ala Ala Thr Ala Pro Ala 
1 5 10 
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(2) INFORMATION FOR SEQ ID NO: 63: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 am no acids 

(B) "YPE : amino acid 

(C) STRANOFDNlSS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCF DESCRIPTION: SEC ID NO: 63: 

Gly Asp lys Pro Ser Pro Phe G'.y G:n Ala Ala Ala Gly Asp Lys Pro 
15 JO 15 

Ser Pro Phe Gly Gin Ala Gly Cys Gly Ser Ser .Met Pro Ser Gly Tnr 
20 25 30 

Ser Glu Glu Gly Ser Ara, Gly Gly Ser Ser Me:, ^o Ala Gly Cys Gly 
35 40 45 

Lys Ala Ala A'a Ala Pro Ala Lys A* a Ala Ala Ala Pre Ala Gly Cys 
50 55 60 

Gly A"! a Glu Pro Lys Ser Ala Glu Pre Lys P^o A"; a Glu Pro i_ys Ser 
65 70 75 8C 

Gly Cys Gly 



(2) INFORMATION- FOR SEQ ID NO: 64; 

<i> SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 amino acids 
. (B) TYPE: ammo acid 

(C) SI HANDEDNESS: 

(D) TOPOLOGY : linear 
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Cxi) SEQUENCE DESCRIPUON: SEC ID NU:64: 

Lys Thr Ala Ala Pro Pro Ala Lys Thr Ala Ala Pro Pro Ala Lys Thr 
1 5 10 15 

Ala Ala Pro Pro Ala 

20 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 618 amino acids 

(B) TYPE: amino acid 

(C) STRANDCDNESS : 

(D) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:65: 

Lys Ala Val Asp Pro Phe Gin Gly Thr Thr Pro Pro Pro Tyr Lys Trp 
1 5 10 lb 

Gin Glu Mot Thr Gly Scr Glu Ala Ala Ala Gly Ser Leu Cys Val Pro 
20 25 30 

Ser !eu Ala Glu Val Ala Gly Gly Val Phe Ala Val Ala Glu Ala Glr 
35 40 45 

Arg Ser Glu Arg Asp Glu Ala Cys Gly His Ala Ala He Ala Thr Thr 
50 55 60 

His Tie Glu Thr Gly Gly Gly Gly Ser lys Ala He Se«* Ala Met Asp 
65 70 75 80 

Ala Gly Val Phe l.eu Val Glu leu Val Asp Ala Ala Ser Gly Th- He 
85 90 95 
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Arg Thr Arg G",u lys Met Gin Pro Thr Thr lie Val Ser Gly Asp Ihr 
100 105 110 

He Tyr Met Ala Leu Gly Asp Tyr Glu Lys Lys T hr Ser Gly Gly Arg 
lib 120 125 

Aid Aid Asp Ala Asp Gly Trp Arg Leu Leu Leu Met. Arg Gly Thr Leu 
130 135 140 

Thr Glu Asp Gly Gly Gin Lys Lys Tie Met "Irp Gly Asp lie Arq Ala 
145 150 155 15C 

Val Asp Pro Val Aid 21e Gly Leu Ihr Gin Phe Leu Lys Arg Val He 
165 170 175 

Gly Gly Gly Gly Ser Gly Val Val Ihr lys Asn Gly Tyr Leu Va' Leu 
160 195 190 

Pro Met Gin Ala Val Glu Lys Asp G"y A--g 5er Val Val Leu Se- Met 
195 200 205 

Arg P.ne Asn Met Arg He Glu Ala Cys Glu _eu Ssr Ser Gly Thr Thr 
210 215 220 

Gly Ser Asr Cys Lys Glu Pro Ser He Ala Asn Leu Glu Gly Asn Leu 
225 230 235 240 

lie Leu lie Thr Ser Cys Ala Ala Gly Tyr Tyr Glu Val 3 he Arg Ser 
245 250 255 

Leu Asp Ser Gly Thr Ser T<~p Glu Met Se- Gly Arg Pro He Ser Arg 
260 265 270 

Val Irp Gly Asn Ser Tyr Gly Arg Lys Gly lyr Gly Val Arg Cys Gly 
275 280 2B5 
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Leu Thr Thr Vel Thr He Glu Gly Arg Glu Val l eu Leu Val I hr Thr 
290 29b 30C 

Pro Val Tyr Leu Glu Glu lys Asn Gly Arg Gly Arg Leu His I eu Trp 
305 310 315 320 

Val I hr Asp Gly Ala Arg Val His Asp Ala Gly Pre He Ser Asp Ala 
325 330 335 

Ala Asp Asp Ala Ala Ala Ser Ser Leu Leu Tyr Ser Ser Gly Gly Asn 
340 315 350 

Leu He Ser Leu Tyr Glu Asn Lys Ser Glu Gly Ser Tyr Gly :.eu Val 
355 360 365 

Ala Val His Val Ihr Thr Gin Leu Glu Arg He Lys Thr Val '_eu Lys 
370 375 380 

Arg Trp Gin Glu Lei; Asp Glu Ala Leu Arc Thr Cys Arg Ser Thr Ala 
385 390 395 4C0 

Thr He Asp Pro Val Arg Arg Gly Met Cys He Arg Pro He leu Tnr 
405 410 415 

Asp Gly t .eu Val Gly Tyr Leu Ser Gly Leu Ser Ihr Gly Ser Glu Irp 
420 42b 430 

Met Asp Glu Tyr Leu Cys Val Asn Ala Thr Val His Gly Thr Val Arg 
435 440 445 

Gly Phe Ser Asn Gly Val Thr Phe Glu Gly Pro Gly Ala Gly Ala Gly 
450 455 460 

Trp Pro Val Ala Arg Ser G*y Gin Asn Gin Pro Tyr His Phe Leu His 
465 470 475 480 
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Lys Thr Phe Thr I eu Vel Val Met Ala Val lie Ms Asp Arg °ro lys 
485 490 495 

Lys Arg Th^ Pro He Pro Leu He Arg Val Val Met Asp Asp Asn Asp 
500 5C5 510 

Lys Thr Val Leu Phe Gly Val Phe Tyr Thr His Asp Gly Arg Trp Met. 
515 520 525 

Thr Val He His Ser Gly Gly Arg Gin He Leu Ser Thr Gly Tro Asp 
530 53b 540 

Pro Glu Lys Pro Cys Gin Val Val Leu Arg His Asp Thr Gly His Trp 
'345 550 555 560 

Asp Pnc Tyr Val Asn Ala Arg lys Ala Tyr Phe Gly Ihr lyr L ys Gly 
565 570 575 

Leu Phe Ser Lys Gin Thr Val Phe his Thr Ser Asn Ser Thr Gly Arg 
580 585 590 

Val Gly Lys Leu Gin Ser Pro Ala lie Cys His Ser Ser Thr P-o Val 
595 6C0 605 

Cys lie Thr Glu Asp Ser He Pro Ser He 
610 615 
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Claims 

1. A method for delecting T. cruzi infection in a biological sample. 

comprising: 

(a) contacting the biological sample with a polypeptide comprising 
an epitope of a T. cruzi antigen having an amino acid sequence encoded by a nucleotide 
sequence recited in SEQ ID NO:l - SEQ ID NO:22, or a variant of said antigen that differs 
only in conservative substitutions and/or modifications; and 

(b) detecting in the biological sample the presence of antibodies 
that bind to the polypeptide, therefrom detecting T. crazi infection in the biological sample. 

2. The method of claim I wherein the biological sample is selected from 
the group consisting of blood, serum, plasma, saliva, cerebrospinal fluid and urine. 

3. The method of claim 1 wherein the polypeptide is bound to a solid 

support. 

4. The method of claim 3 wherein the solid support comprises 
nitrocellulose, latex or a plastic material. 

5. The method of claim 3 wherein the step of detecting comprises: 

(a) removing unbound sample from the solid support; 

(b) adding a detection reagent to the solid support; and 

(c) determining the level of detection reagent bound to the solid 
support, relative to a predetermined cutoff value, and therefrom detecting 71 cruzi infection in 
the biological sample. 

6. The method of claim 5 wherein the detection reagent comprises a 
reporter group conjugated to a binding agent. 

7. The method of claim 6 wherein the binding agent is selected from the 
group consisting of anti-immunoglobulin, Protein G, Protein A and lectins. 

8* The method of claim 6 wherein the reporter group is selected from the 
group consisting of radioisotopes, fluorescent groups, luminescent groups, enzymes, biotin 
and dye particles. 
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9, A polypeptide comprising an epitope of a T. vruzi antigen having an 
amino acid sequence encoded by a nucleotide sequence recited in SEQ ID NO: I - SEQ ID 
NO:21, or a variant of said antigen that differs only in conservative substitutions and/or 
modifications. 

10. An isolated DNA sequence encoding a polypeptide according to claim 

9. 

3 1. A recombinant expression vector comprising a DNA sequence 
according to claim 10. 

12. A host cell transformed or transfected with an expression vector 
according to claim 11. 

13. The host cell of claim 12 wherein the host cell is selected from the 
group consisting of E. call, yeast, insect cell lines and mammalian cell lines. 

14. A diagnostic kit for detecting T, cruzi infection in a biological sample. 

comprising: 

(a) a polypeptide comprising an epitope of a T. cruzi antigen 
having an amino acid sequence encoded by a nucleotide sequence recited in SEQ ID NO:l - 
SEQ ID NO:22, or a variant of said antigen that differs only in conservative substitutions 
and/or modifications: and 

(b) a detection reagent. 

15. The kit of claim 14 wherein the polypeptide is bound to a solid 

support. 

16. The kit of claim 15 wherein the solid support comprises nitrocellulose, 
latex or a plastic material. 

1 7. The kit of claim 14 wherein the detection reagent comprises a reportcr 
group conjugated to a binding agent. 

18. The kit of claim 17 wherein the binding agent is selected from the 
group consisting of antiimmunoglobulin, Protein G t Protein A and lectins. 
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19. The kit of claim 17 wherein the reporter group is selected from ihc 
group consisting of radioisotopes, fluorescent groups, luminescent groups, enzymes, biotin 
and dye particles. 

20. A method for detecting T. cruzi infection in a biological sample, 

comprising: 

(a) contacting a biological sample with a monoclonal antibody that 
binds to an epitope of a 7" cruzi antigen having an amino acid sequence encoded by a 
nucleotide sequence recited in SEQ ID NO:l - SEQ ID NO:22, or a variant of said antigen 
that differs only in conservative substitutions and/or modifications; and 

(b) detecting in the biological sample the presence of 7* criAzi 
parasites that bind to the monoclonal antibody, therefrom detecting /! cruzi infection in the 
biological sample. 

21. The method of claim 20. wherein the monoclonal antibody is bound to 
a solid support. 

22. The method of claim 21 wherein the step of detecting comprises: 

(a) removing unbound sample from the solid support; 

(b) adding a detection reagent to the solid support; and 

(c) deiermining the level of detection reagent bound lo the solid support, 
relative to a predetermined cutoff value, therefrom detecting T, cruzi infection in the 
biological sample. 

23. The method of claim 22 wherein the detection reagent comprises a 
reporter group coupled to an antibody. 

24. A pharmaceutical composition comprising: 

(a) a polypeptide, wherein the polypeptide comprises an epitope of a 
T. cruzi antigen having an amino acid sequence encoded by a nucleotide sequence recited in 
SEQ ID NO:l - SEQ ID NO:22, or a variant of said antigen thai differs only in conservative 
substitutions and/or modifications; and 

(b) a physiological ly acceptable carrier. 

25. A vaccine for stimulating the production of antibodies that bind to 
cruzi, comprising: 
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(a) a polypeptide* wherein the polypeptide comprises an epitope of a 
T cruzi antigen having an amino acid sequence encoded by a nucleotide sequence recited in 
SEQ ID NO:l - SEQ ID NO:22. or a variant of said antigen that differs only in conservative 
substitutions and/or modifications; and 

(b) an adjuvant. 

26. A method for inducing protective immunity against Chagas 1 disease in 
a patient, comprising administering to a patient a pharmaceutical composition according to 
claim 24. 

27. A method for inducing protective immunity against Chagas* disease in 
a patient, comprising administering to a patient a vaccine according to claim 25 

28. A method for detecting T. cruzi infection in a biological sample, 

comprising: 

(a) contacting the biological sample with a first polypeptide 
comprising an epitope of a T. cruzi antigen having an amino acid sequence encoded by a 
nucleotide sequence recited in SEQ ID NO:l - SEQ ID NO:22* or a variant of said antigen 
that differs only in conservative substitutions and/or modifications; 

(b) contacting the biological sample with a second polypeptide 
comprising an epitope of TcD. or a variant thereof that differs only in conservative 
substitutions and/or modifications; and 

(c) detecting in the biological sample the presence of antibodies 
that bind to one or more of said polypeptides, therefrom detecting T. cruzi infection in the 
biological sample. 

29. A method for detecting T. cruzi infection in a biological sample. 

comprising: 

(a) contacting the biological sample with a first polypeptide 
comprising an epitope of a T. cruzi antigen having an amino acid sequence encoded by a 
nucleotide sequence recited in SEQ ID NO:l - SEQ ID NO:22. or a variant of said antigen 
that differs only in conservative substitutions and/or modifications; 

(b) contacting the biological sample with a second polypeptide 
comprising an epitope of TcD, or a variant thereof that differs only jn conservative 
substitutions and/or modifications; 
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(c) contacting the biological sample with a third polypeptide 
comprising an epitope of TcE. or a variant thereof that differs only in conservative 
substitutions and/or modifications; and 

(d) detecting in the biological sample the presence of antibodies 
that bind to one or more of said polypeptides, therefrom detecting T. cruzi infection in the 
biological sample. 

30- A method for detecting 1\ cruzi infection in a biological sample, 

comprising: 

(a) contacting the biological sample with a first polypeptide 
comprising an epitope of a T. cruzi antigen having an amino acid sequence encoded by a 
nucleotide sequence recited in SKQ ID NO:l - SKQ ID NO:22. or a variant of said antigen 
that differs only in conservative substitutions and/or modifications; 

(b) contacting the biological sample with a second polypeptide 
comprising an epitope of TcD, or a variant thereof that differs only in conservative 
substitutions and/or modifications; 

(c) contacting the biological sample with a third polypeptide 
comprising an epitope of PEP-2, or a variant thereof that differs only in conservative 
substitutions and/or modifications; and 

(d) detecting in the biological sample the presence of antibodies 
that bind to one or more of said polypeptides, therefrom detecting T. cruzi infection in the 
biological sample. 

31. A method for detecting T. cruzi infection in a biological sample. 

comprising: 

(a) contacting the biological sample with a first polypeptide 
comprising an epitope of a T. cruzi antigen having an amino acid sequence encoded by a 
nucleotide sequence recited in SEQ ID NO:l - SKQ ID NO:22, or a variant of said antigen 
that differs only in conservative substitutions and/or modifications: 

(b) contacting the biological sample with a second polypeptide 
comprising an epitope of TcE. or a variant thereof that differs only in conservative 
substitutions and/or modifications; and 

(c) detecting in the biological sample the presence of antibodies 
that bind to one or more of said polypeptides, therefrom detecting T. cruzi infection in the 
biological sample. 
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32. The method of claim 31 wherein the second polypeptide comprises the 
amino acid sequence Lys Ala Ala Ala Ala Pro Ala Lys Ala Ala Ala Ala Pro Ala Lys Ala Ala 
Ala Ala Pro Ala (SEQ ID NO:56). 

33. A combination polypeptide comprising two or more polypeptides 
according to claim 9. 

34. A combination polypeptide comprising at least one epitope of a T. 
cruzi antigen having an amino acid sequence encoded by a nucleotide sequence recited in 
SEQ ID NO: 1 - SEQ ID NO:22. or a varianl of said antigen that differs only in conservative 
substitutions and/or modifications, and at least one epitope selected from the group consisting 
of epitopes of TcD, epitopes of TcE, epitopes of PEP-2 and variants thereof that differ only in 
conservative substitutions and/or modifications. 

35. A combination polypeptide according to claim 34, wherein the epitope 
selected from the group consisting of epitopes of TcD, epitopes of TcE, epitopes of PEP-2 
and variants thereof that differ only in conservative substitutions and/or modifications has an 
amino acid sequence recited in SEQ ID NO:55-56. 

36. A combination polypeptide according to claim 34. wherein the epitope 
selected from the group consisting of epitopes of TcD, epitopes of TcE, epitopes of PEP-2 
and variants thereof that differ only in conservative substitutions and/or modifications has an 
amino acid sequence recited in SEQ ID NO;53-54. 

37. A combination polypeptide according to claim wherein the epitope 
selected from the group consisting of epitopes of TcD, epitopes of TcK. epitopes of PEP-2 
and variants thereof that differ only in conservative substitutions and/or modifications has an 
amino acid sequence recited in SEQ ID NO: 5 7. 

38. A method for detecting T cruzi infection in a biological sample, 

comprising: 

(a) contacting the biological sample with a combination 
polypeptide according to any one of claims 33-37; and 

fb) detecting in the biological sample the presence of antibodies 
that bind to the combination polypeptide, therefrom detecting 7". cruzi infection in the 
biological sample. 
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